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Abstract 

We review results concerning the 9 dependence of 4D SU(N) gauge theories and QCD, where 
9 is the coefficient of the CP-violating topological term in the Lagrangian. In particular, we 
discuss 9 dependence in the large- N limit. 

Most results have been obtained within the lattice formulation of the theory via numerical sim- 
ulations. We review results at zero and finite temperature. We show that the results support the 
scenario obtained by general large- N scaling arguments, and in particular the Witten-Veneziano 
mechanism to explain the U{1)a problem. We also compare with results obtained by other 
approaches, especially in the large-iV limit, where the issue has been also addressed using, for 
example, the AdS/CFT correspondence. 

We discuss issues related to 9 dependence in full QCD: the neutron electric dipole moment, the 
dependence of the topological susceptibility on the quark masses, the U(1)a symmetry breaking 
at finite temperature. 

We also review results in the 2D CP^ -1 model, which is an interesting theoretical laboratory 
to study issues related to topology. 

Finally, we discuss the main features of the two-point correlation function of the topological 
charge density. 
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1 Introduction 



Four-dimensional SU(N) gauge theories have a nontrivial dependence on the parameter 9 which appears in 
the Euclidean Lagrangian as [1| 

L» = \f; v {x)F- v {x) - ie-£^e^ p<r F« v {x)F a pa {x), (1.1) 

where 

is the topological charge density. 9 is a parameter of the strong interactions, which leads to the breaking of 
parity and time reversal. 

The spacetime integral of the topological charge density assumes integer values Q = J d x q(x) for any 
continuous classical field configuration with a finite action. Such configurations, satisfying the classical 
equations of motion, were first constructed explicitly over 30 years ago, starting with the instanton solu- 
tion [72,522]. Instantons have been widely employed in a semiclassical approach, in studying tunneling 
phenomena, in supersymmetric theories, in instanton gas and instanton liquid effective descriptions of QCD, 
etc. (See Refs. [148,285,472,481,494,557] for some books and reviews describing the uses of instantons in 
various contexts.) 

In the semiclassical picture, instantons give rise to barrier-penetration processes between different classical 
n-vacua, which are obtained from the perturbative vacuum through nontrivial gauge transformations carrying 
winding number n. The existence of such barrier-penetration processes implies that the true vacua are linear 
combinations of n-vacua. These 9- vacua are given by 

\9)=J2e m6 \n). (1.3) 

n 

In a functional integral formulation, (9-vacua are described by adding the 9 term in the Lagrangian, as in 
Eq. [[13]) . 

The most plausible explanation of how the solution of the so-called U(1)a problem can be compatible 
with the 1/N expansion (performed keeping g 2 N fixed [521]) requires a nontrivial 9 dependence of the 
ground-state energy density F(9) [531,549], 



exp[-KF(0)] = J [dA] exp I - J d 4 x£ e j , (1.4) 

where V is the space-time volume. 

Some of the first approaches to study 9 dependence have employed effective Lagrangians, see e.g. Refs. [51, 
219,462,541,551]. Attempts for a more quantitative assessment of this problem have focused largely on the 
lattice formulation of the theory, using Monte Carlo (MC) simulations. However, the complex nature of 
the 9 term in the Euclidean QCD Lagrangian prohibits a direct MC simulation at 9 ^ 0; instead, one can 
obtain information on the 9 dependence of physically relevant quantities, such as the ground state energy 
and the spectrum, by computing their expansion around 9 = 0; the coefficients of such an expansion can be 
determined from appropriate correlation functions with insertions of the topological charge density at 9 = 0. 
The numerical evidence for 9 dependence, obtained through MC simulations of the lattice formulation, 
appears quite robust. 

The presence of the 9 term has important phenomenological consequences, since it violates parity and 
time reversal symmetry. Experimental bounds on the 9 parameter in QCD are best obtained from the electric 
dipolc moment of the neutron [61,293], which leads to an unnaturally small value for 9, \9\ < 10 -10 . This 



1 The Euclidean Lagrangian ll.lt corresponds to 



in Minkowski space-time. 
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suggests the idea that there must be a mechanism responsible for suppressing the value of 9 in the context 
of QCD. 

The 9 dependence is particularly interesting in the large- N limit where the issue may also be addressed 
by other approaches, such as AdS/CFT correspondence applied to nonsupersymmetric and non conformal 
theories, see e.g. Refs. [12,129,214,289,290]. 

A particularly useful class of models, in which 9 dependence can be studied by various approaches, 
consists of the two-dimensional (2D) CP^" 1 models. These models share a number of properties with four- 
dimensional SU (N) gauge theories, such as asymptotic freedom and nontrivial topology. In addition, they 
afford us the possibility of a systematic l/N expansion which, combined with numerical simulations, leads 
to a precise quantitative description of the models' properties. 

This review is organized as follows: 

In Sec. [2] we discuss the U(1)a problem, and its explanation in the framework of the large- N limit based 
on the Witten-Veneziano mechanism, which requires a nonzero topological susceptibility \ = d 2 F(6)/d0 2 \e—o 
in the pure four-dimensional (4D) SU (N) gauge theory, and therefore a nontrivial 9 dependence. 

Section [3] is devoted to a discussion of the general features of the 9 dependence of the ground state energy 
F(9) and the spectrum of 4D SU (N) gauge theories. In particular, we consider their expansion in powers 
of 9 around 9 = 0, which provides information in the region of small 9 values. We discuss the 9 dependence 
in the large- N limit, where standard scaling arguments indicate that the relevant variable is 9 = 9/N . This 
is also interesting because the large- N limit has been addressed by other approaches, such as the AdS/CFT 
correspondence. Finally, we discuss the 9 term in QCD and chiral perturbation theory, where the 9 term 
is closely connected with complex masses of the quarks. Calculations within the effective chiral Lagrangian 
lead to several important relations among the topological susceptibility, and the quark mass and condensate. 

In Sec. H] we introduce the lattice formulation of 4D SU(N) gauge theory and QCD. In particular, we 
discuss issues related to the lattice regularization of fcrmion Dirac operators. Besides the standard Wilson 
and staggered fermions, we consider Dirac operators satisfying the so-called Ginsparg- Wilson relation which 
lead to lattice regularizations preserving chiral symmetry. We mention in particular the Neuberger overlap 
Dirac operator. Aside from an exact chiral symmetry, Ginsparg- Wilson Dirac operators satisfy an exact 
index theorem on the lattice, at finite lattice spacing, thus leading to a natural definition of topological 
charge. 

Section [5] reviews the lattice methods which have been employed to investigate the topological properties 
of 4D SU{N) gauge theories and QCD. We show how a topological charge density operator can be defined 
on the lattice, and its renormalization properties. We discuss the problems arising when correlators of 
topological charge density operators are integrated over all space-time, thus including coincident points, as 
is required in the computation of the topological susceptibility, and more generally of the 9 expansion of the 
ground state energy F(9). In particular, a naive lattice definition of the topological susceptibility is affected 
by power-divergent additive contributions, which become eventually dominant in the continuum limit. We 
describe some methods which have been used to address this problem when employing bosonic definitions, 
such as the so-called geometric, cooling/smearing, and heating methods. Then, we consider a fermionic 
definition of topological charge density which exploits the lattice index theorem of Dirac operators satisfying 
the Ginsparg- Wilson relation. This definition circumvents the problem of renormalization arising in bosonic 
approaches, leading to well-defined field-theoretical estimators for integrated correlators of topological charge 
density operators, even though at a much higher computational cost. We finally discuss the large- N solution 
of the U (1)a problem within lattice QCD formulations with Ginsparg- Wilson fermions, where one can derive, 
at finite lattice spacing, a relation analogous to the large- TV Witten-Veneziano formula relating the rf mass 
to topological susceptibility. 

In Sec. [5] we review the results obtained for the 9 dependence of the ground state energy and the spectrum 
in 4D SU(N) gauge theories. They mostly refer to the first few terms of the 9 expansion around 9 = 0, in 
particular to \ = d 2 F(9)/d9 2 \g = o, and have been obtained by MC simulations. We review results at zero 
and finite temperature. The results for N = 3 and larger values of N support the scenario obtained by 
general large- N scaling arguments, and in particular the Witten-Veneziano mechanism to explain the U(1)a 
problem. We also compare with results obtained by other approaches, especially in the large- N limit, where 
the issue has been also addressed using, for example, the AdS/CFT correspondence. 

In Sec. [7] we discuss issues related to the 9 dependence in full QCD, such as the neutron electric dipole 
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moment, the dependence of the topological susceptibility on the quark masses, and the U(1)a symmetry 
breaking at finite temperature and across the deconfinement transition. 

Section [5]is dedicated to the 2D CP^ -1 models, and, in particular, to their 9 dependence in the presence 
of a topological term analogous to the 9 term of 4D SU(N) gauge theories. We review analytical results 
obtained in the large- N limit, and numerical results by MC simulations of their lattice formulation. The 9 
dependence of 2D CP 1 ^^ 1 models appears analogous to that conjectured for the 4D SU(N) gauge theories, 
in particular when comparing their large- N limits. 

Section [5] is devoted to a discussion of the peculiar features of the two-point correlation function G(x) 
of the topological charge density; in particular, it addresses the question of how a nonnegative topological 
susceptibility can arise from the integral of the correlation function G(x), i.e. \ — J d d xG(x) > 0, where 
G{x) < for \x\ > due to reflection positivity. Of course, this requires an important positive contribution 
from the contact term at x = 0, which compensates the negative integral for \x\ > 0. These issues are 
discussed within a solvable model, the large- N limit of the 2D CP N ^ X model, in the continuum and on the 
lattice. 

Finally, Sec. [TO] is devoted to the somewhat technical issue of the slow dynamics of topological modes 
in MC simulations; this gives rise to a dramatic critical slowing down, which poses serious limitations to 
numerical studies in the continuum limit. We discuss the implications for simulations of full QCD, and 
the question of whether one can obtain the interesting observables averaged over the full 9 vacuum from 
simulations trapped at a fixed topological sector. 

2 U(1)a problem and large- N limit 

2.1 The U(1) A problem 

The axial U(1)a problem is a long standing issue in QCD, which can be traced back to the early 1970's and 
before; it is intimately related to another, as yet unresolved, puzzle: the "Strong CP problem" . 
The QCD Lagrangian describing Nf quark flavors in Euclidean space is 

, N f 
C = -F^(x)F^(x) + £^ (£> + m f ) j>f ■ (2-1) 
/=i 

where D is the Dirac operator D = ^^(d^+gA^). In the chiral limit of massless quarks, assuming Nf > 1, the 
Lagrangian is invariant under rotations in flavor space, performed independently for left- and right-handed 
components, resulting in a global group of vector and axial symmetries: 

U(N f ) L ® U(N f ) R ~ SU(N f ) v ® SU(N f ) A <g> U{l) v <g> U{1) A . (2.2) 

Let us assume Nf = 3. In the real world, U(l)v remains intact, corresponding to baryon number conser- 
vation, and SU(3)v is only softly broken by differences in the quark masses, leading to the approximate 
symmetry of isospin and strangeness. The axial SU (3) a is spontaneously broken by the dynamical formation 
of nonzero quark condensates, leading to the octet of the lightest pseudoscalar mesons which, in the chiral 
limit, are presumed to be the massless Goldstone bosons corresponding to the eight generators of SU(3) A - 
As for the U(1) A symmetry, if it were intact in the chiral limit, one would expect all massless hadrons to 
have a partner of opposite parity. Since this does not appear to be the case, e.g. there are no scalar pions, 
one may next assume that U(1)a too, is spontaneouly broken. In this case, there should be a corresponding 
isosinglet pseudoscalar Goldstone boson. However, as originally estimated by Weinberg [539] using chiral 
perturbation theory, such a particle, away from the chiral limit, should have a mass less than \f3m-K ; the 
closest candidates are the mesons ?7(549) (already assigned to the 0~ octet) and 77' (985), whose masses are 
clearly beyond the Weinberg bound. 

The resolution of the U(1)a puzzle was proposed by 't Hooft [522,523], who showed that, as a result of 
instanton effects in the QCD vacuum, U(1) A is not a true symmetry of the theory. Under U(1)a transfor- 
mations: 

$ ^ g"* 75 V, (2.3) 
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the action of masslcss fcrmions is invariant, however the path integral measure over fermion fields is modi 
fied [252]: 

- i a g 2 N f f ]4 
32tt 



[#] [#] - exp ( J_ 2 ' / d 4 x e^F^ ) [#] [#] . (2.4) 



This noninvariance of the measure gives rise to the well known axial anomaly equation [3,66,73], which reads 

d^(x)=i2N f q(x) (2.5) 



in the chiral limit, where q(x) is the topological charge density defined in Eq. (|1.2|) . 

The integrand in Eq. (|2.4p is a total derivative; nevertheless the existence of nontrivial topological con- 
figurations, such as instantons, renders Eq. (|2.4p nontrivial. This is essentially related to the boundary 
conditions. If the naive boundary conditions = at spatial infinity are used, the integral in Eq. (|2.4p 
would not contribute and U(1)a would appear to be a symmetry again, 't Hooft showed [522,523] that 
the correct choice of the boundary conditions is that is a pure gauge field at spatial infinity, i.e. either 
A^ = or a gauge transformation of it. With these boundary conditions, there are gauge configurations for 
which J d 4 xq(x) 7^ 0. Despite being superficially a total divergence, the anomaly term has nonzero matrix 
elements at zero momentum. Thus, the U(1)a charge is not conserved, with the result that the theory 
contains neither a conserved U{\)a quantum number, nor an extra Goldstone boson. 

Comparing Eq. (|2.4p with the 9 term in the Lagrangian, —i9q(x), we see that a chiral rotation on massless 
fermions is equivalent to a shift in 0: 9^9 — 2aNf . Conversely, an initial 9 term could be rotated away in 
the presence of massless fermions; on the other hand, if the fermions have nonzero mass, as is the case with 
all quark species, such a chiral rotation affects their mass matrix. Allowing for complex quark masses, we 
may write the Lagrangian mass term in the form 

Nt 

^ _ y _ . 

Cm = 2^ mf ^ f ^ 1 + l5 ^ f+ 2^ m * f ^ f( - 1 ~ l5 ^ f ^^2^f( Rem f +ilmm f^)^f ■ (2.6) 
/ / f=i 

P and T symmetries are broken for Imm/ ^ 0. The transformation (|2.3|) modifies to/ as: 



to/ ->■ e 



'2 in 



to/. (2.7) 



Thus, we may rotate to/ to a real value by an appropriate choice of a (possibly different for each flavor). The 
shifted value of 9 after these rotations is still a source of P and T noninvariance; in particular, on dimensional 
grounds, it leads to an electric dipole moment for the neutron of order [541] 

dn „ e^f ~ 9^4- * 1O~ 16 e cm. (2.8) 
mi ml 

The experimental bound on d n is \d n \ < 2.9 x 10~ 26 ecm [61,293]. Supplementing this experimental bound 
with the rough estimate (|2 . 8[> . and also with further theoretical calculations indicating \d n /9\ ~ O(10~ 15 - 
10~ 16 ) ecm (which are reviewed in Sec. 17. ip . one then gets an exceedingly small value for 9: \9\ < 10 -10 . A 
similar bound has also been obtained by measuring the electric dipole moment of the 199 Hg atom [461]. We 
shall return to this issue in Sec. 17.11 

A number of possible explanations have been proposed for the origin of this small value of 9, such 
as spontaneous breaking of CP, vanishing of the mass of the lightest w-quark, and postulating the axion, 
which effectively promotes 9 into a dynamical variable (see, e.g., [443,509,541], for some relevant recent 
reviews). In particular, in the latter theory, originally proposed by Peccei and Quinn [444], the dynamical 
field replacing the 9 parameter could relax to a minimum of the effective potential where the parity and 
time reversal symmetries are recovered. This would also require the existence of a new particle [540,542], 
the axion. The various models using this basic mechanism that have been proposed to solve the CP problem 
have in common the existence of a further chiral symmetry [/(1)pq, which is spontaneously broken at high 
energies, much higher than the QCD scale, and is also broken by the anomaly. This explanation still 
represents an interesting open possibility, although the parameter regions related to the axion have been 
much restricted, see Refs. [443,541,555] and references therein. The vanishing of the mass m u of the lightest 
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w-quark has also been proposed as solution of the strong CP problem [343]. This possibility is disfavored 
by a standard current algebra analysis [264], which shows that the experimental data are consistent with 
a nonzero mass. Nonzero quark masses are also strongly supported by calculations within lattice gauge 
theory see e.g. Rcfs. [41,53,71,203,280]. For a recent review on quark masses see Ref. [555]. Moreover, 
in Refs. [155, 159] it has been argued that the vanishing of a single quark mass is renormalization scheme 
dependent due to nonperturbative effects related to the anomaly, therefore the vanishing of m u should not 
be relevant to a physical issue such as violation of the CP symmetry in strong interactions. 

Through the anomaly equation physically interesting matrix elements of hadron phenomenology are 
related to the topological properties of the theory. An example is given by the large mass of the rf , as we 
shall see later. Another example is related to the so-called proton spin problem, see e.g. Refs. [423,488,489, 
492,532]. 

2.2 The large- iV limit 

The large- N limit, where N is the number of colors, is a very useful framework to investigate the physics of 
strong interactions. In the context of particle physics the 1/A expansion was introduced by 't Hooft [521]. 
He proposed to generalize the SU(3) gauge symmetry of QCD to SU(N) and to expand in powers of 1/A. 
In fact, A is the only free parameter in QCD [550]. A is an intrinsically dimensionless parameter; the origin 
of A dependence is basically group-theoretical, and leads to well-defined field representations for all integer 
values, hence it is not subject to any kind of renormalization. The 't Hooft large- N limit is given by 

N -> oo keeping g 2 N, N f fixed. (2.9) 

The dominant Feynman graphs in the large- N limit can be classified by simply counting powers of N [147, 
521,550]. In general, the leading large- A contributions come from planar diagrams, each fermion loop 
introduces a factor 1/A, while each nonplanar crossing is suppressed by 1/N 2 . The Feynman diagrams 
contributing to connected correlation functions of gauge invariant operators constructed using gauge fields 
are 0(N 2 ). They are planar and do not contain fermion loops. The connected parts of correlation functions 
of fermionic currents are O(N). The corresponding diagrams are planar, they do not have internal fermion 
loops, and all current insertions are on a single fermion loop which bounds the graph. 

The most interesting feature of the large- AT expansion is that the phenomenology of QCD in the large- 
A limit presents remarkable analogies with that of the real world. Assuming that the N = oo theory 
confines, so that the physical spectrum consists of color singlets only, one can study properties of hadrons by 
applying power counting in N, and analyzing the intermediate states that contribute to the various n-point 
correlation functions, as discussed e.g. in Rcfs. [147,550]. This leads to a scenario which is qualitatively, and 
also semi-quantitatively, consistent with those of the real world (see for example Refs. [169,391,453,463] for 
reviews) . 

We also mention the Veneziano large- N limit [530] 

AT, AT/ -> oo keeping g 2 N,N f /N fixed. (2.10) 

This limit provides a better explanation of certain aspects of low-energy phenomenology. However, the 
't Hooft limit is simpler, and has been studied in more detail. 

We should finally note that the large- N solution of 4D SU(N) gauge theories is still unknown, and 
therefore it is not possible to perform a systematic 1/N expansion. The problems in determining the large- 
N saddle point are essentially related to the matrix nature of the theory, see for example Ref. [453] for a 
detailed discussion. Nevertheless, the 1/N expansion of QCD represents a very useful framework, within 
which several phenomenological issues can be discussed, and nontrivial predictions can be inferred, see e.g. 
Refs. [391,453]. We also mention that the 1/N expansion has been instead successfully developed in vector 
models, such as 0(A)-symmetric theories and CP^" 1 models (see e.g. Refs. [408,453,557]). 

The conjectured large- AT scenario of QCD (discussed at length in Polyakov's book [453]) can be inves- 
tigated by performing lattice calculations at relatively large A, studying their convergence for A — > oo in 
the 'quenched' case (i.e. with no dynamical fermions, given that contributions of quark loops should be 
depressed by a factor 1/N). 
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2.3 Solution of the U(1)a problem within the large-iV limit: The Witten-Veneziano 
formula 

In general a broken symmetry is best understood by studying the limit leading to a theory where the 
symmetry is conserved. In the case of ^(l)^ symmetry, such a limit should be that of a large number of 
colors: N — + oo. An explanation of the explicit breaking of the U(1)a symmetry based on large- TV arguments 
was originally proposed by Witten [549] and then refined by Veneziano [531]. According to Witten's argument 
the U(1)a problem should be solved at the lowest nonplanar level, i.e. at the next-to-leading order of its 
1/JV expansion. Let us sketch this argument. We introduce the topological susceptibility \ 



X 



3 2 F(9) 



de 2 



d 4 x(q(x)q(0)), (2.11) 



0=0 

as defined in Euclidean space. We recall that in Minkowski space-time the topological susceptibility reads 

X =-i f d i x(0\Tq(x)q(0)\0) (2.12) 

(more details on this definition and its relation with the corresponding Euclidean quantity can be found in 
Refs. [161,397,549]). 

While there is no 9 dependence in perturbation theory, there may be a nontrivial dependence at a 
nonpcrturbative level, and in particular within the 1/N expansion. Assume that the pure gauge theory 
without quarks is ^-dependent to leading order in 1/N, and therefore that x 7^ at N — oo. This assumption 
leads to an apparent paradox: on the one hand, when massless quarks are introduced the 9 dependence must 
disappear and therefore x — 0, on the other hand quark loops give only nonleading contributions of order 
1/N to the physical processes of the pure gauge theory [521]. This paradox can be solved by the presence of 
a particle with mass squared m 2 of order 1/N, having the same quantum numbers of the topological charge 
density. Such a particle should be related to the 77', i.e. the lightest flavor-singlet pseudoscalar in nature. 

The existence of a particle of mass squared 0(1/N) can be justified noting that in the large- N limit the 
singlet axial current should be conserved. The anomaly is an 0(1/ N) effect, because it arises from a diagram 
with one quark loop, which is depressed by a factor 1/N. Therefore at N = 00 there should be an axial 
singlet Goldstone boson. At 0(1/ N) the anomaly is recovered and, as a consequence, the N — 00 Goldstone 
boson gets a mass. At order 1/N, m 2 should receive an 0(1/ N) contribution, in that the mass squared of an 
approximate Goldstone boson is in general linear with respect to the symmetry breaking parameter, which 
in this context is represented by 1/N. 

So as a consequence of a nonzero large- N limit of x m the pure gauge theory, we would have m 2 ~ 1/N, 
while the other Goldstone bosons associated with the nonsinglet axial symmetry remain massless at the 
chiral limit. A further development of these ideas results in the leading order relationship 

f 2 m 2 

X=^^, (2.13) 
X AN f ' 17 

where f s is defined by 

(01^1*) = ^/.- (2-14) 



Notice that f s is of order V N and in the large- N limit f s = f ns = f v (where / w is defined as in Ref. [555]). 
By performing a more accurate analysis, based on the large-iV limit (|2.10p and using also the anomalous 
flavor-singlet Ward-Takahashi identities, Veneziano [531] refined the relationship (|2.13p obtaining 

^-X = rn*,+m*-2m 2 K . (2.15) 

Jit 

In Eqs. (|2. 13|) and (|2.15p . due to their large- N based derivation, x should be considered that of the pure 
gauge theory. This fact favors a check by Monte Carlo simulation of the lattice formulation, in that pure 
gauge simulations are much simpler than full QCD ones. Substituting Nf = 3 and the experimental values 
in (|2~T5)1 (i.e. [555] U » 131 MeV, m v > « 958 MeV, m, « 547 MeV, m K » 494 MeV) the prediction is 
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X ~ (180 MeV) . This prediction has been substantially verified by lattice computations, as we shall see 
below. 

We mention that the Witten-Veneziano formula has been also derived within the lattice formulation of 
QCD, when the so-called Ginsparg- Wilson lattice fermions are considered [279], see also Sec. 15.51 Moreover, 
it has also been investigated within the AdS/CFT framework in Refs. [50,65,309,348,465]. 

Models based on instanton semiclassical pictures can hardly explain the nontrivial large- N behavior of the 
topological susceptibility, x — 0(N°), and therefore of the rf mass, m 2 , ~ 1/N. The naive dilute-instanton 
picture would suggest that the topological susceptibility and m v > vanish exponentially as e~ cN , c being some 
constant, because the instanton weight behaves as exp(— Sir/g 2 ), and the 't Hooft large-iV limit must be 
taken keeping g 2 N fixed, thus leading to an apparent exponential suppression. Actually, since a nontrivial 
./V-dependent prefactor [523] appears in the weight of the instanton distribution, this argument strictly 
applies only to small instantons, leaving open the possibility to get a consistent scenario by appropriately 
taking into account instanton contributions at the QCD scale. The consistency of the semiclassical instanton 
calculation and 1/N expansion of QCD has been discussed in Refs. [468,469]; in particular it was argued 
that the instanton liquid model, where large (overlapping) instantons get suppressed by ad-hoc short-range 
repulsive instanton interactions [493], is not necessarily incompatible with the large- N scenario. This issue 
has been also discussed [171,339] within the 2D CP 1 ^^ 1 models, see also Sec. 18.31 



3 9 dependence of 4D SU(N) gauge theories 
3.1 General features 

In order to study the 9 dependence of the ground-state energy in 4D SU(N) gauge theories, it is convenient 
to introduce the dimensionless scaling function 

m . m^m, (3 ,) 

where F(6) is the ground-state energy defined in Eq. (|1.4j) . and a is the string tension at 9 = 0, which can 
be obtained from the area law of Wilson loops. Of course, one may use any other energy scale to define a 
dimensionless function related to the ground state energy, for example the lowest glueball mass. 

As a consequence of the topological nature of the 9 term when applied to continuous field configurations, 
which leads to the quantization of topological charge, the ground-state energy F(9), and therefore /(#), is 
expected to be periodic in 9, with periodicity 27r. 

The 9 dependence can be studied in the region of small 9 values by expanding f(9) around 9 = 0. 
Of course, we are assuming that the theory is not singular at 9 = 0, and in particular that CP is not 
spontaneously broken at 9 = [528]. For this purpose, the scaling function f(9) may be parametrized as 

f(9) = \c9 2 s(9) 1 (3.2) 

where s(9) is a dimensionless function of 9 such that s(0) = 1. C is the ratio x/& 2 an d X is the topological 
susceptibility at 9 = 0, 

X = jd i x{q(x) q (0)) = ^-, (3.3) 

where 

Q = J d 4 xq{x) (3.4) 

is the topological charge. The function s(9) can be expanded around 9 = as 

s(9) = 1 + b 2 9 2 + b A 6 A + • • • , (3.5) 

where only even powers of 9 appear. The coefficients of the expansion of f(9) are related to the zero- 
momentum rt-point connected correlation functions of the topological charge density, and therefore to the 



10 



moments of the probability distribution P{Q) of the topological charge Q. If all &2n were vanishing, leading 
to s(9) = 1, then the corresponding distribution P(Q) would be Gaussian, i.e. 

P(Q) = _L=exp (S^r) ■ ( 3 - 6 ) 



V 2(Q 

Therefore the coefficients b 2n of the expansion of s(9) parametrize the deviations from a simple Gaussian 
behavior. For example, the coefficients of the first few nontrivial terms are given by 



X4 

12 X ' 



(3.7) 



X4 = ^[(Q 4 }-3(Q 2 ) 2 ] e=0 , (3.8) 



and 



X6 

360x' 



(3.9) 



1 



Xe = y [(Q 6 ) ~ 15(Q 2 )(Q 4 ) +30(Q 2 ) 3 ] e=Q , (3.10) 

etc. It has been recently shown by Luscher [387] (see also [278]) that correlation functions involving multiple 
zero-momentum insertions of the topological charge density can be defined in a nonambiguous, regularization- 
independent way, and therefore the expansion coefficients b 2n are well defined renormalization-group invariant 
quantities. 

Besides the 9 dependence of the ground-state energy, one may also consider the 9 dependence of the 
glucball spectrum and of the string tension a (6), from the area law of Wilson loops. By analogy with the 
ground state energy, <j{6) may be expanded around 9 = as: 

a{9) = a (l + s 2 9 2 + ...), (3.11) 

where a is the string tension at 6 = 0. Similarly for the lowest glueball state: 

M{9)=M(l + g 2 9 2 + ...), (3.12) 

where M is the ++ glueball mass at 9 = 0. Of course, at 9 7^ 0, the lightest glueball state does not have 
a definite parity anymore, but it becomes a mixed state of ++ and h glueballs. The coefficients of the 
above expansions can be computed from appropriate correlators at 9 = 0, see also Sec. 16.51 

Another interesting related issue is the possible spontaneous breaking of CP at 9 = tt. SU(N) gauge 
theories, as well as QCD, in the presence of the 9 term are invariant under CP not only at 6 = 0, but 
also at 9 = tt. This is because 9 — it goes to 9 = —tt under a CP transformation, but physics should be 
unchanged for 9^9 + 2tt, so that the values 9 = ±ir are equivalent. However, CP can be spontaneosuly 
broken at 9 = tt, with the appearance of two CP violating degenerate vacua [170]. According to the Vafa- 
Witten theorem, [57,528] which has been argued to hold under quite general conditions (the argument makes 
positivity assumptions on the path integral), this possibility is excluded at 9 = 0. The spontaneous breaking 
of CP at 9 — tt is expected to be generally related to a first order transition when varying 9. These issues 
have been investigated in Refs. [14,33,152,219,237,340,369,496,524,551]. They have also been addressed 
within softly broken supersymmetric QCD, see e.g. Refs. [239,358], arriving at analogous conclusions for the 
dependence on 9 around 9 = tt. The 9 dependence in softly broken supersymmetric QCD-like theories has 
been also discussed in Refs. [210,246,359,361,483]. 

3.2 9 dependence in the large-iV limit 

Large- N scaling arguments [548,551,552] applied to the Lagrangian (jl.ip indicate that the relevant scaling 
variable in the large- N limit is 

B = ±. (3.13) 
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This can easily be derived by recalling that the Lagrangian is 0(N 2 ) and the large- N limit is performed 
keeping g 2 N fixed. Accordingly, the ground-state energy is expected to behave as 

f(e) = m - F( - 0) =N 2 f(9), (3-14) 

where the function f(x) has a nontrivial limit as N — » oo. 

In the dilute instanton gas approximation, the 9 dependence has the form 

-87r 2 /sV e - ( Kg* N)je/N s h 



e -m /g e w = ^-m , (g N) e »/» \ (3 15) 

in the one-instanton sector. This is exponentially suppressed in AT, thus it might suggest that the 9 de- 
pendence is exponentially small in N . This conclusion is incorrect, essentially because the instanton gas 
approximation is not valid due to infrared divergences. 

The large- AT behavior of the coefficients 62™ of the expansion (|3.5[) can then be easily derived: 



W) = ^CcoPil + b 2 9 2 + k9 4 + •••), (3.16) 
where C M is the large- N limit of the ratio C = x/a 2 . Comparing with Eq. (|3.2p , one obtains 



c = c «> + Jj2+--> (3.17) 



b 23 = %L + ... (3.18) 

We recall that a nonzero value of is essential to provide an explanation to the U(1)a problem in the 
large- N limit, and can be related to the rf mass [549] through the relation 

Xoo = f -^ + 0(l/N). (3.19) 

We note in passing that the quantity b 2 also lends itself to a physical interpretation, being related to the 
r]'-r)' elastic scattering amplitude [531]. 

The large- N scaling behavior is apparently incompatible with the periodicity condition f(9) — f(9 + 2w), 
which is a consequence of the quantization of the topological charge, as indicated by semiclassical arguments 
based on its geometrical meaning for continuous field configurations. Indeed a regular function of 9 = 9/N 
cannot be invariant for 9^9 + 2n, unless it is constant. A plausible way out has been proposed by Witten 
in Ref. [551]: the ground-state energy F{9) in the large- iV limit is a multibranched function because of many 
candidate vacuum states which all become stable, although not degenerate, for N = 00. Such behavior is 
exhibited by some two-dimensional models [146,549]. This scenario leads to the ground-state energy [551] 

F(9)=N 2 Min k H^±^y (3.20) 

where H is an unspecified function; F{9) is then periodic in 9, but not regular everywhere, since at some value 
of 9 there is a jump between two different branches. This issue, and in particular the consistency between 
the 9/N dependence in the large-A limit and the 2tt periodicity in 9, has been discussed in Refs. [247,292, 
313,314,436,551]. 

The conjecture (|3.20p was then refined in Ref. [552]. F(9) must have its absolute mininum at 9 = 
essentially because at 9 — the integrand of the Euclidean path integral is real and positive. If the vacuum 
is unique at 9 = 0, then the mininum must occur for k — 0. Moreover, the large-iV solution of the U(1)a 
problem requires d 2 F/d9 2 \e=o > in the large- iV limit and, therefore, we can write F(9) — F(0) oc 9 2 ; higher 
orders do not contribute to leading order in l/N. The simplest expression combining this behavior with 
periodicity is then given by 

F{9) - F(0) = A Miiife (9 + 2-Kkf + 0(1/N). (3.21) 
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In particular, for sufficiently small values of 9, i.e. \0\ < tt, 

F(6)-F{0)=A6 2 + O(l/N 2 ). (3.22) 

This issue has been discussed within a field-theoretical framework in Refs. [259,482]. The same result has 
been derived using the conjectured correspondence between large- N gauge theories and supergravity/string 
theory on some particular compactified spacetimes [552], and also by calculations based on the fivebrane of 
M theory to study the 9 dependence of softly broken supersymmetric SU(N) gauge theories [440]. As we 
shall see, this scenario is supported by the results of Monte Carlo simulations of the lattice formulation of 
4D SU (N) gauge theories, presented in Sec. [5] 

Concerning the spectrum of the theory, the large- N scaling arguments, which indicate 9 = 9/N as 
the relevant Lagrangian parameter in the large- N limit, imply that the coefficients of the 9 expansion in 
Eqs. (|3.11[) and (|3.12[) are suppressed, in particular S2 and gi should decrease as 1/N 2 . This is suggestive of 
a scenario in which the 9 dependence of the spectrum disappears in the large- N limit, at least for sufficiently 
small values of 9 around 9 = 0. The only effect of the 9 term on the lowest spin-zero glueball state is that it 
becomes a mixed state of ++ and h glueballs, as a consequence of the fact that the 9 term breaks parity 
but the mass of the state does not change. This conclusion has been also reached by an analysis of the 9 
dependence of the glueball spectrum using AdS/CFT [260]. This scenario is also supported by the results 
of lattice Monte Carlo simulations reported in Sec. 16.51 



3.3 9 term in QCD and effective chiral Lagrangians 

An important approach to the study of 9 dependence in QCD is provided by chiral perturbation theory, 
see e.g. Refs. [265,266], also [541] and references therein. The low-energy physics of Goldstone bosons in 
QCD can be described systematically using an effective theory: In the case of Nf light quarks, the chiral 
Lagrangian of the Goldstone boson SU(Nf) matrix field U in the lowest-order approximation is given by 

C ch = ^F 2 Tr[d M C/%[/] - l-STrlMeWNftf + M^e^^U], (3.23) 

where the coupling F is related to the pion decay constant, F = s» 92 MeV at lowest order of chiral 

perturbation theory and S is related to the quark condensate in the massless theory, 

E = JM. (3.24) 
N f 

M is the real diagonal quark mass matrix, M = diag(m„, rtid, ■■■)■ The kinetic term proportional to F 2 is 
invariant under chiral SU(Nf)i, ® SU(Nf)n transformations, while the mass term breaks chiral symmetry. 

Let us consider the simplest case of Nf degenerate flavors of mass m. In this case the chiral symmetry is 
broken to SU(Nf)v- From Eq. (|3.23p . one can derive an expression for the ground-state energy by minimizing 
the action, i.e. [551] 

F(6)-F(0)=NfmY±[l-cos(6/N f )], (3.25) 

which is valid for 16*1 < ir. When it is extended periodically to other values of 9, it shows cusps at 9 = 
2ir(k + 1/2) [551]. From Eq. (|3.25p one can derive the topological susceptibility 



d 2 F{9) 

X - 



d9 2 

An analogous expression can also be determined for different quark masses 



=o J 



x = s 



e— y 1 - 



Given that, by virtue of the anomaly, the flavor-singlet pseudoscalar particle rj' is heavy, it is not a 
Goldstone boson and, therefore, its physics cannot be described using chiral perturbation theory in the form 
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of Eq. (|3.23p . However, in the large- N limit the anomaly disappears and the rj' becomes a Goldstone boson, 
as discussed above; thus, it may be studied in chiral perturbation theory. To this end, the chiral Lagrangian 
is modified as [219,372,424,462,551] 

C ch = ij^tt^l/tfytf] - hn±[Me i9 ' N itf + M^e- l6/N W] - ~ X (lndett/) 2 , (3.28) 

where U is now a U(Nf) matrix, whose determinant is given by exp(iyj2N frf / F). The last term in this 
chiral Lagrangian gives the rj' mass m 2 , — 4Nfx/f%- At large N the pion decay constant f„ is 0(y/~N), while 
the topological susceptibility x of the quenched theory is 0(1). Therefore, at large N, m 2 , is suppressed by 
1/N, consistently with the rj' transforming into a Goldstone boson. Since 9 is related to the determinant of 
the quark mass matrix, by appropriately redefining the field associated with the rj 1 particle, one can rewrite 
Eq. (pQgj) as 

£ ch = ^F 2r Ti[d^d^U] - ~£Tr[MZ7+ + Aft 17] + ^ X (ilndetl/ - 9f . (3.29) 

This theory can be used to obtain some information on the 9 dependence of the spectrum at large TV, see 
e.g. Ref. [108]. Of course, the 9 dependence must disappear at leading order, and this is encoded in the 
above chiral Lagrangian because its last term is suppressed, as a consequence of the fact that the r.h.s. of 
the anomaly equation (|2.5p vanishes. 



4 Lattice formulation of the theory 

A nonperturbative formulation of QCD is essential for understanding the low-energy hadronic physics. In 
this respect the Euclidean lattice formulation originally proposed by Wilson represented an important break- 
through. Indeed, it provided an elegant nonperturbative formulation of QCD, emerging from the critical 
continuum limit of a statistical four-dimensional system. This opened the road to the use of the powerful 
numerical techniques of statistical mechanics, and in particular of Monte Carlo simulations. During the last 
few decades, the results of this approach and their agreement with experiments have been remarkable (see 
e.g. the recent Refs. [99,172,526] and Ref. [555] for reviews). For example, the hadron masses computed 
from lattice QCD with Nf = 2+1 flavors of dynamical quarks agree quite well with the observed spectrum 
in nature, as shown in Fig. [1] [526]. 

In statistical mechanics, continuum field theory may be seen as a tool in the study of critical phenom- 
ena [544,546,547], since the real world is represented by the lattice models, which for example may be 
associated to crystalline structures of solids. Lattices play an important role at short distance, but become 
irrelevant in the critical region, where they can be just seen as regulators of the field theory which describes 
the critical behavior. Conversely, in studying fundamental interactions one adopts the opposite point of view: 
The real world is represented by a continuum field theory, and a lattice may be introduced as a regulator 
for this theory, with the lattice spacing a as cutoff. The lattice system has no physical meaning and within 
a large class of universality its choice is not unique, but it can be studied at any temperature/coupling by 
exploiting the lattice techniques of statistical mechanics, so that one can get information about its critical 
region, which should be described by the initial continuum theory. This idea provides a nonperturbartive 
formulation of a quantum field theory, from the critical behavior of statistical models. 



4.1 Lattice formulation of SU(N) gauge theories 

A lattice formulation of 4D SU(N) gauge theories, preserving local gauge invariance, is given by the Wilson 
action [545] defined on a four-dimensional hypercubic lattice and constructed using link variables U^(x) € 
SU(N), defined for each link between neighboring sites x and x + afx (here, a is the lattice spacing, and 
pb= 1, 2, 3, 4 stands for a unit vector along one of the spacetime directions). The Wilson action is 

Sl = ~4 fl4 E ReTrLVOr), 0=™, (4.1) 

X,fJ,>f Ju 

where go is the bare coupling and 11^^ is the product of link variables along a 1 x 1 plaquette of the lattice, 

n„ v {x) = Up(x)U v (x + afl)Ul(x + av)Ul{x). (4.2) 
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Figure 1: Lattice results for the spectrum compared with the experimental values, from Monte Carlo 
simulations of QCD with Nf = 2 + 1 dynamical 0(a) improved Wilson quarks, taken from Ref. [526]. In 
each pair of data, the left (right) point has been expressed in GeV using as input the mass of (<j>). 



Formally, in the a — > limit, performed by setting = exp(ioA^), where = t" 'A® (t a are the group 
generators, normalized so that 2Tr t a t b = 5 ab ), and expanding in powers of the lattice spacing a, we recover 
the continuum action 

S = [ d 4 x^TTF^F^. (4.3) 

The statistical theory develops a mass gap, and therefore a length scale £. The continuum theory is 
nonperturbatively defined in the critical limit <7q — > 0, when £ — > oo in units of the lattice spacing. According 
to standard renormalization-group arguments, a physical length scale £ satisfies the equation 

£-(s + *<»>5;) «-<*■">■ <iA) 
where @l(9o) is the lattice /3-function, defined by 

PL(go) = a-j-go, (4.5) 
and the derivative is performed keeping the physical quantities fixed. A weak coupling expansion gives 

pL(g ) = -b gl-b l9 5 + ..., (4.6) 

where 

11 N , 34 ( N 



By solving Eq. I|4.4|) we obtain the behavior of the length scale as a function of the bare coupling g 

f = aexpj" g^L. (4.8) 
Replacing the perturbative expansion of the /3-function we obtain the asymptotic behavior 

« - « M)~" 6l/26g exp (J^) [1 + O( 5o 2 )] . (4.9) 
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Dimensionless and renormalization-group invariant physical quantities R, such as the ratio C = xl® 2 
introduced in the preceding section, approach their continuum limit R*, and therefore their physical value, 
with 0(a 2 ) scaling corrections, apart from logarithms of a. Therefore, in the limit go — > 0, we have 

R(gl) - R* ~ ^ . (4.10) 

The calculation of dimensionless and renormalization-group invariant quantities is then favored in that the 
scaling regime (|4.10|) should set in much before that of the asymptotic scaling formula (|4.9|) . whose corrections 
get suppressed only logarithmically, i.e. they are 0(l/ln£). 

The relevant continuum limit must be taken at infinite volume, therefore it must be reached keeping 
1 <C £/a <C L where L is the size of the lattice. At finite £, in order to get infinite volume results, one 
may perform calculations at large but finite volume (finite size scaling theory allows one to control finite 
size effects in the scaling region, see e.g. Ref. [128]), thus working with a finite, although large, number 
of variables. One can then perform numerical Monte Carlo simulations in order to obtain nonperturbative 
information close to criticality; extrapolations controlled by field theory (using Eqs. (|4.9[) and (|4.10[) ) lead 
to the desired physical numbers of the continuum theory. 

4.2 Fermions on the lattice 

The major difficulty in introducing fermions on the lattice is that one usually ends up with having more 
fermions in the continuum limit than intended. This is dubbed the "fermion doubling" problem; for example, 
the naive discretization of the Dirac operator leads to sixteen fermions in the continuum limit. The no-go 
theorem [435] of Nielsen and Ninomiya states that this is unavoidable under a few plausible assumptions. 
See, for example, Refs. [151,404] for general discussions of this issue. In the following we briefly discuss the 
various proposals that have been considered so far, up to the various versions of so-called Ginsparg- Wilson 
fermions, and in particular the overlap fermions, which have circumvented the no-go theorem, leading to 
a lattice regularization of fermions coupled vectorially to a gauge field which preserves chiral symmetry. 
Recent reviews discussing chiral symmetry on the lattice are found in Refs. [134,153,305,432,433,558]. 

4.2.1 Wilson fermions 

In Wilson's formulation [545] of lattice QCD, the desired number of fermions in the continuum limit is 
achieved by adding an appropriate irrelevant term to the naive lattice discretization of the Dirac operator. 
The Wilson action for QCD is given by 

Sl = -^J2 ReTVn^z) +J2M*)( D w + m 0J )V/(z)- (4.11) 

x,ix,u x,f 

Dw is the Wilson Dirac operator 

Av = \ W (V* + V„) - ™v;V M ] , (4.12) 

where 

Vrf>{x) = - [U(x, n)^{x + afi) - il>(x)] , (4.13) 

V^(a;) = - - U{x - afi, fj,)^(x - afi)] , 

r is the Wilson parameter and / is a flavor index. The r term breaks chiral symmetry, but it becomes 
an irrelevant operator in the continuum limit, where chiral symmetry should be recovered. Therefore, the 
price paid to have the correct number of fermions in the continuum limit is an explicit breaking of the 
chiral symmetry, which causes 0(a) lattice artefacts and other unwanted effects at finite lattice spacing. For 
example, the quark masses are affected by an additive renormalization, which shifts the chiral limit to a 
negative value of moj- Improved versions of the Wilson action are obtained by implementing the Symanzik 
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improvement procedure [510], using for example the so-called clover action [388,480], where an appropriate 
(fi dependent) tuning of an additional parameter in the action can achieve the cancellation of the 0(a) 
scaling corrections, leaving only 0(a 2 ) corrections. 

An alternative discretization of QCD is given by staggered fermions [357,507]. This is for example 
described in textbooks such as [404,464]. Staggered fermions achieve a partial reduction in the degrees 
of freedom of naive fermions by a so-called spin diagonalization. Moreover, they exhibit a residual lattice 
symmetry which protects the quark mass from additive renormalization, and scaling corrections are Ola 2 ). 
The major drawback of staggered fermions is that fermion doubling is only partially reduced, from sixteen 
to four species, but this does not reproduce QCD. Monte Carlo simulations with fewer that four fermion 
species have been performed by taking fractional powers of the fermionic determinant in the functional 
integral. While the results obtained so far appear quite promising, see e.g. Ref. [172], the full correctness of 
the corresponding continuum limit is still debated, see e.g. Refs. [158,362]. 



4.2.2 Ginsparg- Wilson fermions 

The conceptual problems related to chiral symmetry can be overcome if the lattice Dirac operator D satisfies 
the so-called Ginsparg- Wilson (GW) relation [273] which, in its simplest form, may be cast as 

ToD + D To = aD~f 5 D. (4.14) 

The GW relation implies the existence of an exact chiral symmetry of the lattice action under the transfor- 
mation [384] 

5if>(x) = e 7 5 (l - \aDj i/,(x), (4.15) 

5$(x) = ei>(x) ^1 - \ aD ^j 75- 

This symmetry protects the quark masses from additive renormalizations [304,427], and lattice effects are 
0(a 2 ) in the continuum limit. The associated chiral Ward identities ensure the nonrenormalization of vector 
and flavor nonsinglet axial vector currents, and the absence of mixing among operators in different chiral 
representations. Thus, the pattern of lattice renormalization becomes greatly simplified with respect to 
lattice fermions violating chiral symmetry [16, 126,304]. Lattice gauge theories with GW fermions have been 
proved to be renormalizable to all orders of perturbation theory [459] . 

Lattice Dirac operators satisfying Eq. (|4. 14[) are not affected by the Nielsen-Ninomiya theorem [435], 
thus they need not suffer from fermion doubling. Note however that, although the GW relation implies 
some generalized form of chirality on the lattice, it does not guarantee the absence of doublers. Solutions 
of the GW relation without doublers have been found, such as domain wall fermions [258,342,479], overlap 
formulations [411-413,427,428,430,432], and the so-called classical fixed-point Dirac operator [91,307,308]. 
A lattice formulation of QCD satisfying the Ginsparg- Wilson relation overcomes the complications of the 
standard approaches based on Wilson or staggered fermions, where chiral symmetry is violated at the scale 
of the lattice spacing. However, the numerical implementation of these chirality preserving Dirac operators 
is significantly more CPU intensive than the standard approaches. 

The GW relation and the associated symmetry (|4.15p imply relations at finite lattice spacing which are 
substantially equivalent to those holding in the low-energy phenomenology associated with chiral symmetry 
(see e.g. Refs. [131,353,433]). A natural candidate for the chiral condensate at finite lattice spacing is given 

by 

E L = -$(l-iaI>)v>. (4-16) 

which can be easily obtained by applying a nonsinglet transformation to the quantity ^2 x (tpt a "f5tp(x))- Sl 
is the order parameter which is expected to be nonzero in the thermodynamic limit, 

lim lim S L ^ 0, (4.17) 

m—*0 V-^oo 
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leading to the spontaneous breaking of chiral symmetry. As shown in Ref. [131], massless pions emerge 
when the lattice chiral symmetry is broken, due to the nonzero condensate (|4.17|) . This can be inferred by 
considering the zero-momentum pion correlation function 

Gab = (^(^(0)^75^(0) rp(x)t b ^(x)) (4.18) 

X 

in the presence of a mass term m. Using the GW relation, in the chiral limit one arrives at the relation [131] 

L^H-IodW (4.19) 



lim G ab = S ab 

m—>o crm \ I J 

Thus, a nonzero condensate implies that the r.h.s. is singular in the chiral limit. Since G ab ~ rn~ 2 , this 
leads to the well known relation ~ m. 

The axial anomaly then arises from the noninvariance of the fermion integral measure [384] under flavor- 
singlet chiral transformations (|4. 1 5[) 

5[d$dip] = eoTr(7 5 D)[##] = -2eN f {n+ - n-)[drpdip], (4.20) 

where n± are the number of zero modes of D which are also eigenstates of 75 with eigenvalues ±1, see also 
Refs. [1,139,253,385,508]. This implies that the flavor-singlet chiral transformations present an anomaly on 
the lattice like that in the continuum, and provide a natural fermionic definition qt{x) of topological charge 
density as [307] 

qi {x) = ^Tr[ 75 D(x,x)}, (4.21) 

and an index theorem on the lattice, i.e. 

Qi = &( X ) = indcx ( D )- ( 4 - 22 ) 

x 

We will return to this point in Sec. 15.41 Considerations on the flavor-singlet pion correlator [131] lead also 
to the relation (where all flavors are assumed degenerate) 

i((n + -0 2 ) = ^ (4.23) 



to leading order in the chiral limit. This is analogous to the continuum relation derived in Sec. 13 . 31 cf. 
Eq. (535) . 

The same result can be inferred [433] by introducing a 9 parameter as in the continuum. In the case of 
Nf degenerate quarks, we can write the lattice action as 

Se = Sgaugc + [^ D i> + P&b&l + $liI>r\ q(x), (4.24) 

X X 

where /j, is a complex mass and q(x) is given by Eq. (|4.21[) . When transforming the fermion fields as 
ipL e _m- 0ij one can restore the invariance of the action by changing /1 — > e la fi and the change of the 
measure can be cancelled by 9 — > 9 — NfOt. Therefore, the ground-state energy should only depend on the 
combination 

He i9 / N f (4.25) 

as in the continuum. By repeating the arguments of Ref. [373], leading to Eq. (|3.25l) . with the same natural 
assumptions, the ground-state energy density can be written as 

F{9) = ~N f Z L Re (V e/JV ') + 0(|/i| 2 ). (4.26) 

Then, by taking the second derivative with to respect to 9 for real fi = m, one obtains Eq. (|4.23D . 

Let us briefly mention the known examples of Dirac operators that satisfy the GW relation. One of them 
is the "fixed-point" Dirac operator, which is based on the idea of a classically perfect action obtained from 
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the fixed point of appropriate renormalization-group transformations as determined by classical saddle point 
equations [308]. At the classical level these actions do not have cutoff effects; however, 0(a 2 ) cutoff effects 
reappear at the quantum level. This idea has been applied to QCD in Refs. [91,180,183]. The corresponding 
Dirac operator satisfies the GW relation [304] . Domain wall fermions are based on the idea [342] that massless 
4D lattice fermions arise naturally as zero modes localized on a domain wall embedded in a 5D space-time. 
This allows to construct a lattice vector-like gauge theory preserving chiral symmetry. The five-dimensional 
fermion action to represent quarks in lattice QCD was proposed in Refs. [258,479]. Finally, there is the 
overlap Dirac operator proposed by Neuberger [427], which is described below. 



4.2.3 The Neuberger overlap Dirac operator 

The simplest lattice formulation satisfying the GW relation is Neuberger's overlap Dirac operator [427,428, 
430]; it has been derived from the overlap formulation of chiral fermions on the lattice [413], and can be 
shown to be an effective Dirac operator which describes the massless chiral mode of the domain wall fermion. 
It is given by 



1 + X(X^X)~ 1/2 ] , (4.27) 



a 

X = D w --p, (4.28) 
a 

where Z?w is the Wilson-Dirac operator (|4. 12[> . and p is a free real parameter whose value must lie within a 
certain range, in order to guarantee the correct pole structure of the fermion propagator; the perturbative 
limits of that range are < p < 2. The Fourier representation of £>n shows that no fermion doublers appear. 
Several perturbative studies have been performed with this fermion lattice formulation, see e.g. [16, 18,127, 
149,318,352]. 

As shown in Refs. [319,322], under some general assumptions which include the absence of doublers, 
GW fermions cannot be ultralocal, i.e. any lattice variable must be coupled to an infinite number of other 
variables. Indeed, the Neuberger-Dirac operator is not strictly local, and locality should be recovered 
only in a more general sense, i.e. allowing an exponential decay of the kernel of £>n a t a Tate which scales 
with the lattice spacing and not with the physical quantities. In Ref. [312] the locality of Dn has been proved 
for sufficiently smooth gauge fields. Moreover numerical evidence has been presented for typical gauge fields 
in present-day simulations. 

Unlike the Wilson-Dirac operator £>w, Dn is not analytic in the link variables when the operator X 
in Eq. (|4.27|) has a zero eigenvalue. However, such a lack of analyticity is expected to be harmless in 
the continuum limit [312,430]. The nonanalyticity of general GW fermions, and in particular of overlap 
fermions, is strictly related to the existence of the lattice index theorem. On the one hand, the lattice 
configuration space is simply connected when using conventional actions. On the other hand, the lattice 
index, corresponding to an integer winding number, must be singular as one passes from one sector to 
another [154]. The locations of these singularities actually depend on the particular GW Dirac operator. One 
way to circumvent these nonanalyticities is by putting constraints on the roughness of the gauge fields [312, 
431]. Note that this would also imply that transitions among different topological sectors become suppressed. 
Actions implementing this idea have been considered in Refs. [88,255]. As noted in Ref. [156], such constraints 
would lead to actions without a positive transfer matrix. 

In conclusion, the Neuberger overlap Dirac operator provides a lattice regularization of massless QCD, 
i.e. of chiral fermions coupled vectorially to a gauge field, without the need of fine tuning. We should also 
say that the numerical implementation of dynamical overlap fermions is still a great challenge today. Indeed 
their Monte Carlo simulations are considerably slower than simulations of Wilson or staggered fermions, 
see e.g. Refs. [42,186-188,233,250,417,429,430]. A review of Monte Carlo algorithms for simulations with 
overlap fermions is reported in Ref. [467]. 



5 Topology from the lattice 

The topological winding number, which classifies continuum 4D SU(N) gauge field configurations [523], relies 
on certain smoothness assumptions. This winding number is uniquely defined for smooth fields, however the 
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path integral requires integration over all configurations, some of which may not be sufficiently smooth. In 
particular, the lattice regularization makes the topology strictly trivial, because the configuration space of 
lattice fields is simply connected. It is only in the continuum limit that physical topological properties are 
expected to be recovered. 

5.1 The topological charge density on the lattice and its renormalization 

We first recall that, in pure gauge theories, the topological charge density q(x) is a renormalization-group 
invariant composite operator, i.e. its anomalous dimension is zero, when it is defined in an appropriate 
renormalization scheme, such as minimal subtraction MS [236]. One can straightforwardly define lattice 
operators corresponding to the topological charge density, by constructing a local function qh{x) of the 
lattice fields which has the topological charge density q{x) as its classical continuum limit: 

q L {x) ~^a 4 q(x)+0(a 6 ). (5.1) 

qh(x) is not unique, indeed infinitely many choices differing by 0(a 6 ) terms can be conceived. Various lattice 
versions qi,(x) have been introduced in Rcfs. [60,143,215,216,240,390]. A simple example is given by the 
twisted double plaquette operator [215] 

1 ±4 

Ql(x) = ~ 2 4 x 327r 2 zZ PV 11 ^] J ( 5 ' 2 ) 

where is the lxl plaquette operator defined in Eq. (|4.2p . One can easily check that, classically, 
Eq. (|5.1J) is satisfied. The classical continuum limit of lattice operators must be in general corrected by 
lattice renormalizations at the quantum level, see e.g. Ref. [97]. This also holds in the case of the topological 
charge density, thus we have [119] 

q L (x) — > a 4 Z L (gl)q(x) + 0(a 6 ), (5.3) 

where Z^g^) is a finite function of the bare coupling g^ going to one in the limit g^ — > 0, which can be proven 
using the fact that the continuum operator q(x) has zero anomalous dimension. The function Z^(go) can be 
computed in perturbation theory [119]; some perturbative calculations are reported in Refs. [39,119,120,143, 
208,495]. For example, for the operator (TO)) a one loop calculation [119] gives0Z L (gg) = 1-0.908^+0(5$) 
for SU(3). This straightforward approach allows us to compute general <i-dimcnsional Euclidean correlation 
functions of the topological charge density operator, 

G q (xi,x 2 , —,x n ) = (q(x 1 )q(x 2 )...q(x n )}, (5.4) 

by exploiting the field-theoretical relation 

G^(x 1 ,x 2 ,...x n ) = a~ nd Z L {g )- n {q L {x x )q L {x2)...q L {x n )) +0{a). (5.5) 

Analogous relations can be written down for correlation functions with insertions of other operators at 
different space-time positions. However, it is important to stress that this holds only when Xi ^ Xj for any 
i 7^ j. Indeed, the coincidence of two spacetime arguments gives rise to peculiar contact terms, which make 
the relation between lattice and continuum correlation functions complicated, as we shall see later. 

The above relations become more intricate in the presence of fcrmions, essentially because the topological 
charge density is not renormalization-group invariant anymore; instead, it presents a mixing with fcrmionic 
operators at the quantum level. Unlike pure gauge theory, in full QCD the topological charge density 
mixes under renormalization with d^j^. The nonrenormalization property of the anomaly in the MS scheme 
means that the anomaly equation should assume exactly the same form in terms of bare and renormalized 
quantities [4]. However, the renormalization of d^j^x) and q{x) is nontrivial. In the chiral limit the 
Euclidean anomaly equation reads 

d fl jf t (x)=i2N f q(x). (5.6) 



2 The lattice topological charge density is renormalized in such a way that its renormalized correlation functions coincide 
with those obtained in the MS renormalization scheme. 
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Rcnormalizcd operators in MS are obtained by [4,66,236,367,491,519] 

i2N f q(x)\ (I z-l\ fi2N f q(x) 

d »3» )r ~ V° * A d »3» 



(5.7) 



where 



e = 2 — d/2 and c F = (N 2 — 1)/(2N). The structure of the renormalization matrix above assures the 
stability of the anomaly equation under renormalization. As a consequence of Eq. (|5.7|l . the topological 
charge density has an anomalous dimension, and therefore the relation between the generic matrix elements 
of any lattice version of the topological charge density and its continuum counterpart cannot be as simple 
as in Eq. ()5.5|) . Using general renormalization-group arguments, one can show that [37] 

(a\i2N f q L \b) = Y(g 2 ) (a\R\b) , (5.9) 

where a, b are generic states, Y(g 2 l ) is a finite function of g^, which can be computed in perturbation theory, 
and 

(a\R\b) = (a\d^(x) m \b)e X1? [ j^dg (5.10) 

is a renormalization-group invariant quantity; d^j^ {x)j^g indicates the operator d^j^x) renormalized in the 
MS scheme, and the function 7(5) is related to the anomalous dimension of the continuum operators q(x), 
dfj_j^(x) in the MS scheme: 

d , 1 3c c 



^ ) = M __i nz = ___r_ii A r /ff 4 + 0(5 6) (511) 

In the usual case of the operator (|5.2p . one obtains 



Y{gl) = 1 - 0.908.^ + —^L—g 2 + {gt) (5.12) 

for N = 3, which leads to Y(jg%) = 1 - 0.887gg + 0(g$) for N f = 2. 

As an example of physically relevant matrix elements of the topological charge density, we mention the 
one over proton states which enters the so-called proton spin problem [423,488,489,492,532], because it can 
be related to on-shell nucleon matrix elements of the singlet axial current j*j through the anomaly. Indeed, 
the on-shell nucleon matrix element of the topological charge density 

(p,e\q\p',e') = MB(k 2 )u(p,e)i l5 u(p' ,e') (5.13) 

can be related to the on-shell nucleon matrix element of the singlet axial current j^, 

{P,e\jl \p',e') = u{p,e) [G^k 2 )^ - G 2 (k 2 )k^ 5 ] u(p',e'), (5.14) 

where e,e' label the helicity states and k is the momentum transfer. In a naive wave function picture 
Gi(0) can be interpreted as the fraction of the nucleon spin carried by the quarks. Using the axial anomaly 
equation one can easily show that B(0) = G\(0)/Nf. Lattice studies of this issue have been reported in 
Refs. [17,22,224,256,466]. 

5.2 The topological susceptibility and problems from power-divergent additive 
contributions 

Serious problems arise when the quantity that we want to study involves also correlation functions (|5.4p 
with coincident points, i.e. in the limit \xi — xA — > 0. This is for example required to compute the 
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9 dependence of the ground-state energy around 8 — 0. In particular this limit enters the definition of 
topological susceptibility 

X = J d 4 x(q(0)q(x)). (5.15) 

The lattice counterpart (qL{x)qL(y)) does not reconstruct correctly the singular behavior for x — > y, which, 
as we shall see, is essential to determine the physical value of the topological susceptibility, see in particular 
Sec. [SJ The relation of the zero-momentum correlation of two lattice operators Ql(iz;), 

Xz = X>i(aO«L(0)> = i<Qi>, Ql = $>l(z), (5.16) 

X X 

with the continuum x is affected by the presence of an unphysical background term, which becomes dominant 
in the continuum limit. In general, we expect 

X L (go) = a 4 Z L ( g() ) 2 x + B. (5.17) 

Heuristic arguments based on the operator product expansion would lead to [120] 

B(9o) = P(9o)(I) + a 4 ^(5o)(T) + 0(a 5 ). (5.18) 

where I is the identity operator and T is the trace of the energy-momentum tensor. Therefore, the background 
term is a power divergent additive contribution, which eventually becomes dominant in the continuum limit. 
The so-called perturbative tail P(go) can be computed in perturbation theory. For some definitions of q^ 
it has been computed to high order, see e.g. Refs. [28,215,495]. However, the perturbative series is only 
asymptotic, and it is expected to have nonperturbative contributions due to renormalons which cannot be 
disentangled from the 0(a 4 ) physical signal, see e.g. Refs. [74,211,212]. Therefore, perturbation theory does 
not help to disentangle the interesting 0(a 4 ) physical signal in Eq. (|5 . 1 7|) from the renormalization effects. 
The computation of x through Eq. (|5 . 1 T[) requires a well defined nonperturbative subtraction of these terms, 
and therefore a rigorous field-theoretical prescription to unambiguously extract the quantity which has the 
desired continuum limit from the lattice data. In this respect, the main problem may be related to the fact 
that even in the continuum the integrand (q(x)q(O)} in the definition of x is singular for x — > 0, and it is 
not completely clear what prescription provides the correct physical quantity after integration. In Sec. 15.41 
we will report the solution of the problem as outlined in Ref. [387]. The singular behavior of (q(x)q(0)) for 
x — > is discussed in some detail in Sec. [9] 

5.3 Bosonic definitions exploiting various methods 

Geometrical, smoothing (such as cooling and smearing), off-equilibrium (such as heating) techniques have 
been used to address the problems caused by power-divergent additive contributions and multiplicative 
renormalizations in definitions of the topological susceptibility based on discretized versions of the topological 
charge density operator q(x). 

5.3.1 Geometrical method 

The so-called geometrical method [76,382,447] meets the demands that the topological charge on the lattice 
have the classical correct continuum limit and that it take integer values for every lattice configuration in 
a finite volume with periodic boundary conditions. In 4D SU (N) gauge theories this can be achieved by 
performing an interpolation of the lattice field, from which the principal fibre bundle is reconstructed. One 
can unambiguously assign a topological charge to a configuration, provided it is sufficiently smooth and 
satisfies certain bounds [382]; for example, in the case of SU(2) gauge theory, plaquettes must satisfy [382] 

Tr[l - n^] < 0.03. (5.19) 

However, the required constraints are far from the values of the plaquettes in actual simulations, thus leaving 
some ambiguities in the assignment of topological charge to each Monte Carlo configuration. Due to their 
global topological stability, geometrical definitions are not affected by perturbative renormalizations. Since 
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on the lattice each configuration can be continuously deformed into any other, integer valued geometrical 
definitions cannot have an analytical functional dependence on the lattice field. 

Geometrical definitions may be plagued by topological defects on the scale of the lattice spacing, termed 
dislocations [75,383], whose nonphysical contribution may either survive in the continuum limit (as could 
happen in the SU(2) gauge theory with Wilson action [456]), or push the scaling region for the topological 
susceptibility to large values. These warnings have been substantially confirmed by the available results 
for SU(2) and SU(3) gauge theories, and two-dimensional CP 1 ^^ 1 models. For example, the results for 
the SU{2) [282,363,364,456] and the SU(3) [281,282] gauge theories (the latter obtained for < 6.0) 
gave significantly larger values than those of other methods. A better behavior has been observed when 
measuring the geometrical charge on blocked configurations [456] . The problem was reanalyzed in Ref. [458] 
by exploiting the fact that the geometrical definitions can be obtained as limits of sequences of standard 
analytic operators. The results indicate that, with increasing N, the unphysical contribution should get 
suppressed, and that at least at large N the geometrical charge should be free from dislocations. This 
scenario has been checked in two-dimensional CP 1 ^^ 1 models by Monte Carlo simulations [125]. 

Monte Carlo results using this method can be found in Refs. [281,282,363,364,456] for the 4D SU{2) 
and SU{3) lattice gauge theories, and in Refs. [75, 124, 125, 193,338,533,554] for the 2D CP N ~ 1 models. 



5.3.2 Smoothing methods 



In this class of methods the topological properties are read from the configurations generated by Monte Carlo 
simulations after applying appropriate smoothing procedures which eliminate the short-ranged renormaliza- 
tion effects and provide an (approximately) integer value for the topological charge. Various smoothing 
procedures have been proposed and employed in numerical works. 

In the cooling method, see e.g. Refs. [75,328,332,335,512], \ is measured on an ensemble of configurations 
which are "cooled" ; that is, link variables are gradually changed in a way as to minimize the action locally. 
For example, cooling can be achieved by setting = oo in a standard Monte Carlo updating procedure, such 
as Metropolis or heat bath. By virtue of the locality of this procedure, it is ideally expected to eliminate the 
short ranged fluctuations responsible for the renormalization effects, without modifying the global topological 
content of a configuration. The latter may then be extracted from the corresponding cooled configuration, 
finding an (almost) integer value of Ql = ^2 x Ql(x), where qL{x) may be any lattice discretization of the 
topological charge density, such as the one in Eq. (15. 2ft . Different implementations may differ in the speed 
of the minimization procedure and in the way one reads the value of the topological charge Q from the 
cooled configuration. The topological susceptibility is obtained by measuring \l = (Q 2 )/V on the cooled 
configurations, typically when the result becomes stable with respect to the number of cooling steps. 

One possible realization of cooling is by means of a smearing procedure, analogous to that proposed in 
Ref. [15], in which the original links of a configuration are replaced by smeared links, constructed as follows: 



Vf)(z) 



(l-c)K (l 



»(x) 



- X) {x)V^- x \x + u)V^(x + M )t, 



V^{x) 



^V^\x)\V^\x) 



N 



1/2 



(5.20) 



where Vzl(x) — Vi • {% — v)\ and c is a free parameter, which can be tuned to optimize the properties of 
q^(x). This approach lends itself to a Heisenberg picture description [143]: starting from a simple definition 
of qL, such as Eq. (|5.2[) . one constructs sequences of operators by replacing the link variables with the 
smeared link variables. One may consider 



1 



±4 



2%2 ^^P a 



Tr 



n w n (l) 

pv pa 



(5.21) 



where Yi^l is the product of smeared links V^ ! (x) around a 1 x 1 plaquette. All these operators have the 
correct classical continuum limit, i.e. (x) — » a i q(x) for a — > 0. As shown in Ref. [143], the renormalization 
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Figure 2: Histograms of the lattice topological charge as measured by cooling, using the standard twisted 
double plaquette operator in Eq. ()5.2|) . at g$ = 1 (0 = 6) and on a 16 4 lattice for the SU(3) gauge theory, 
after n = 4 (dashed line) and n — 12 (full line) cooling steps. The data clearly tend to cluster around integer 
values as the number of cooling steps is increased. 



effects get suppressed with increasing number of iterations. For example, the multiplicative renormalization 
of the operator with one smearing step is given by Z^g^) = 1 — 0. 247<?o + O(g^) (for the optimal value 
c = 0.6774), as compared to Z^g^) = 1 — 0.908i?q + O(g^) for the operator (|5.2p . However, since the size of 
<j£ (x) increases with the number of iterations i, one must keep a fixed maximum value of i while approaching 
the continuum limit, otherwise the operator cannot be strictly considered as a local operator anymore. 

A delicate point in cooling is to check for eventual losses of topological charge; such losses would not 
occur in the continuum, given the global stability of the topological charge. On the lattice, where instantons 
are only quasi stable, the charge is bound to vanish eventually, after protracted cooling: Given that lattice 
formulations are in general not scale invariant, an instanton can actually decrease its action by shrinking in 
size, and eventually disappear. In this respect, some improvements have been proposed, see e.g. Refs. [112, 
175,262,263,529], by selecting procedures which make lattice instanton-like configurations more stable under 
smoothing. This is achieved, for example, by modifying the action in the minimization procedure such that 
instantons tend to increase under cooling [262]. Ref. [302] proposes another smoothing procedure based on 
hypercubic blocking. Other proposals can be found in Refs. [181,303,329,405,407]. Moreover, topological 
structures have been also investigated using filtering methods constructed by keeping the lowest eigenstates of 
the overlap Dirac operator [324,325] or Laplacian bosonic operators [111], see also Ref. [110] for comparisons 
of the results of these methods. 

Figs. [2] and [3] show typical results obtained using the cooling technique at g$ = 1 (/3 = 6) for the SU(2>) 
gauge theory. The histograms of Fig. [2] show how the values Ql of the topological charge of the cooled 
configurations, as obtained using the lattice operator (|5.2[) . tend to cluster around integer values when 
increasing the number of cooling steps. In Fig. [3] we show the lattice topological susceptibility versus the 
number of cooling steps. We show data for two definitions from the same sample of cooled configurations, 
i.e. results using the lattice operator (|5.2| . and results obtained by taking the integer value closest to Ql as 
estimator of Q, as outlined in Ref. [195]. The latter procedure shows a better convergence, while the first 
one requires a protracted cooling to reach a plateau with respect to the cooling steps. Anyway the difference 
between the results of the two procedures is expected to vanish as 0(a 2 ) in the continuum limit. 

The basis for determining physical topological properties by these smoothing methods is clearly heuristic, 
and their systematic errors are not under robust theoretical control. But, as we shall see, after several 
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Figure 3: The lattice topological susceptibility \l = {Q\)/V versus cooling step at = 1 (/3 = 6) 
for the SU(3) gauge theory, on a lattice of size 16 3 x 36. The data indicated by full circles are obtained 
using the operator in Eq. (|5.2p . The data indicated by crosses have been obtained using the procedure of 
Ref. [195], which reads integer values of Q from the cooled configurations. The latter set of data show better 
convergence; the dotted line indicates the plateau where the value of of xl is taken. 



numerical checks and comparison with other approaches, their results regarding zero-momentum topological 
properties have been seen to be quite reliable. 

Monte Carlo results using methods based on these smoothing procedures can be found in Refs. [110, 
118, 120, 174, 175, 182, 262, 263, 283, 328, 335,402, 403, 452, 456, 504, 512-514, 529] {SU{2) gauge theories), [117, 
144, 174, 222, 300, 303, 316, 332, 425, 501 , 504, 515, 517, 556] (SU(3) gauge theories), [164, 195, 196, 375, 377, 379] 
(SU(N) gauge theories), [29,75,124,125,332] (CP^ 1 / 0(3) models). 

5.3.3 Off-equilibrium methods to determine the renormalization effects 

The renormalization effects in the relation (|5.17|) between the lattice quantity xl an d the topological sus- 
ceptibility x, i.e. the multiplicative renormalization Z and the background contribution B, can be estimated 
by exploiting off-equilibrium Monte Carlo simulations. When using a standard local algorithm, for instance 
Metropolis or heat-bath, quantities like the topological charge, involving changes of global properties of the 
configurations, show a slow dynamics, much slower than the dynamics of quasi-Gaussian modes determining 
other features of the theory, see e.g. Refs. [195,378] and also Sec. [10] The so-called heating method [209] 
relies on the possibility of somehow separating the various contributions to Eq. (|5.17p by means of off- 
equilibrium simulations which exploit the slow dynamic behavior of the topological modes during Monte 
Carlo simulations, and in particular during the thermalization process, giving the opportunity to estimate 
Z and B in Eq. (|5.17j) . Similar considerations were also reported in Ref. [516]. 

The main idea behind this method is to perform Monte Carlo simulations where the lattice modes 
responsible for the additive and multiplicative renormalizations in Eq. (|5.17p are quasi-thermalized, while 
the topological modes are instead frozen and essentially determined by the chosen starting configuration. 
In particular, one can estimate the multiplicative renormalization Z by starting from an instanton-likc 
lattice configuration with Ql ~ 1, measuring how Ql changes under thermalization (i.e. MC steps of 
local algorithms, such as Metropolis or heat-bath) and repeating this procedure many times to eventually 
obtain averages over many different trajectories. If a window exists in which the modes responsible for 
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Figure 4: This figure shows the ratio Ql,u/Ql,o versus n, where Ql,u is the lattice topological charge as 
measured by the operator (|5.2|) after n steps of a local heat-bath algorithm, when heating an instanton- 
like configuration of charge Ql,o ~ 1 and averaged over many independent trajectories, using the Wilson 
action at g$ = 1 (/3 = 6) for the SU(3) gauge theory. The observed plateau provides an estimate of the 
multiplicative renormalization Z. From Ref. [26]. 



the renormalizations are already approximately thcrmalizcd, while the original topological modes remain 
substantially unchanged, then we expect to observe a plateau in the data of Ql with respect to the number 
of heating steps, which provides an estimate of the multiplicative renormalization Z [209]. Fig. [4] shows MC 
data in the case of the operator (|5.2p using the Wilson action at g$ = 1 (/3 = 6) for the SU(3) gauge theory: a 
plateau is clearly observed suggesting Z(g$ = 1) w 0.2. On the other hand, starting from flat configurations 
where Ql = and measuring how the lattice susceptibility xl — Q\jV changes under thermalization, one 
can obtain an approximate estimate of the background term B in Eq. (|5.17p . The data of xl are expected 
to present a plateau with respect to the number of heating steps, which provides an estimate of B. Here one 
is assuming that the dependence of B on the global topological charge Q is negligible. As an example, in 
Fig. [5]we show again results in the case of the operator (|5.2|) using the Wilson action at g$ = 1 for the SU(3) 
gauge theory. Finally, once Z and B are estimated, % can be extracted from Eq. (15.17|) . by supplementing 
the calculation with a standard Monte Carlo simulation at equilibrium to determine xl- As shown in Fig. El 
in the case xl is computed using the operator (|5.2[) . the background B gives a large contribution to xl at 
<7o = 1 already, and this clearly affects the precision of the estimate of x- Better results are obtained by 
considering improved smeared operators [143], whose renormalization effects are smaller. 

Like the cooling method, this method relies on heuristic arguments, and their systematic errors are not 
under robust theoretical control. However, several numerical checks and comparisons with other techniques 
have been reported, showing that good estimates of x are obtained by the heating method. 

Monte Carlo results using this method can be found in Refs. [24,26,27,30,31,33-35,143,199,533] for the 
4D SU(2) and SU(3) lattice gauge theories, and in Refs. [124,209,242,458] for the 2D CP^ 1 models. 



5.4 Fermionic definitions by the index theorem of Ginsparg- Wilson fermions 

Fermionic methods to determine the topological charge density, and its correlations, are essentially based on 
the anomalous flavor-singlet axial Ward-Takahashi identities. The realization of the axial anomaly on the 
lattice has been largely discussed in the literature, see e.g. Refs. [1,2,98,139-141,251,254,287,304,311,347, 
353, 385, 412, 413, 430, 433, 460, 497-500, 527] . 
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Figure 5: This figure shows XL,n versus n, where xl,u is the lattice topological susceptibility as measured 
by the operator (|5.2[) after n steps of a local heat-bath algorithm at g$ = 1 ((3 = 6) for the SU(3) gauge 
theory, starting from a flat configuration and averaged over many independent trajectories. The dot-dashed 
line indicates the equilibrium value of \l (the dotted lines indicate the error). The dashed line shows the 
estimate of the background B obtained by averaging data on the plateau. From Ref. [534]. 



In the continuum theory and for the massless Dirac operator the Atiyah-Singer index theorem [52] states 
that 

^xe^pvTrF^Fpa. = index(L>) = n + - n_ , (5.22) 

where n± are the number of zero modes of the Dirac operator D which are also eigenstates of 75 with 
eigenvalues ±1. Any acceptable discretization of the Dirac operator may be used to determine the topological 
charge for sufficiently small values of a, by determining its almost zero modes. In this respect Dirac operators 
satisfying the GW relation are ideal, because their lattice chiral symmetry allows exact zero modes. As 
discussed in Sec. 14.2.21 Dirac operators satisfing the GW relation preserve an exact chiral symmetry at 
finite lattice spacing, cf. Eq. (|4.15|) . The corresponding Jacobian gives rise to the axial anomaly [384], and 
therefore to a natural definition of the topological charge density [307] 

qi (x) = ~Tr[ l5 D(x,x)], (5.23) 

where D{x,x) represents the kernel of the lattice Dirac operator in position space and the trace is taken over 
the Dirac and color indices. The associated topological charge is equal to the index of the Dirac operator, 
i.e. 

Qi = ^ Qi(x) = n+ — n- (5-24) 

X 

(as above, n± are the number of zero modes with positive and negative chirality). Then, the topological 
susceptibility can be defined as 

X = £te(°)*( s )> = ^- (5.25) 

X 

Analogously, one may define zero-momentum 2n-point correlation functions of qi(x), which are needed for 
the computation of the 9 expansion around 9 = 0. 

The index of the overlap Dirac operator, and in general of all Dirac operators satisfying the GW relation, 
provides a well-defined estimator for the topological charge [278,279,387,432], which can also be used in pure 
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Figure 6: The topological susceptibility from the overlap method at (3 = 5.9, 6.0, 6.1 for the Wilson lattice 
formulation of the 4D SU (3) gauge theory, as a function of the parameter p of the overlap Dirac operator. 
Taken from Ref. [198]. 



gauge theories. Indeed, as shown by Liischer [387], the corresponding zero- momentum 2n-point correlation 
functions, and in particular the topological susceptibility, can be expressed in terms of appropriate zero- 
momentum correlation functions of scalar and pseudoscalar quark operators which are free of short-range 
singularities, such as 

X = mi...m s a 4s ~ 4 ^ {P r i(x 1 )Si2{x 2 )--S r -i. r (x r ) P a<r+1 (x r+1 )S r+ i tr+ 2{x r+ 2)...S s -i tS (0)) c (5.26) 




where a, 6 are flavor indices. Since the normalization of the correlation functions is fixed by the nonsinglet 
chiral Ward identities, these expressions provide an unambiguous definition of the topological susceptibility, 
and in general of the zero-momentum 2n-point correlation functions. By a further refining of this argu- 
ment [387], these results can be shown to also hold in the case of the pure gauge theory. Therefore, also in 
pure SU (N) gauge theories the index of the overlap Dirac operator provides a well defined fermionic defi- 
nition of topological charge and the moments of its distribution at 6 — 0, which determine the coefficients 
of the 9 expansion. This method circumvents the problem of renormalization arising in bosonic approaches, 
albeit at a much higher computational cost than bosonic methods. 

We also remark that in the case of the overlap Dirac operator, an explicit ambiguity arises from the fact 
that when one varies the parameter p, cf. Eq. (14.27|) . within its theoretically acceptable range, an eigenvalue 
of the operator X = Dw — {p/ a ) may occasionally change sign, thus changing the value of the operator 
index [198,231]. This gives rise to a corresponding ambiguity in the value of the topological charge to be 
assigned to the underlying gauge field. However, the effects of this ambiguity are expected to disappear in 
the continuum limit, and therefore lattice topological susceptibilities defined using different values of p are 
expected to only differ by 0(a 2 ). The dependence of p has been numerically studied in Ref. [198]. Some 




Xl,...X s _l 



where 1 < r < s < Nf, and 




(5.27) 
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results for \ are shown in Fig. [6] as a function of p, where one can clearly observe that the dependence of p 
gets reduced with increasing /3, thus approaching the continuum limit. 

We should also mention the warning of Refs. [156, 159] on the possibility of a residual nonperturbative 
ambiguity in defining the topological susceptibility in pure gauge theories. 

More detailed discussions of the fermionic methods and Monte Carlo results can be found in Refs. [223, 
228,232,497] (using ultralocal fermion actions), and Refs. [1,90,131,139,142,178,185,191,198,223,225,227, 
231, 233, 253, 271, 274-279, 306, 307, 412, 418, 428, 432, 433, 508, 556] (using Ginsparg- Wilson fermions). 

Wc finally mention a recent proposal [43] for the calculation of the topological susceptibility, by computing 
correlation functions at fixed topological sector. Expectation values of observables at fixed global topological 
charge Q differ by 0(1/V) terms (V is the space-time volume) from those at 6 — 0, obtained by sampling 
the topological sectors. Under some assumptions, these differences can be determined by a saddle-point 
expansion [108]. In particular, the large-distance behavior of the two-point correlation function of the 
topological charge density q(x) at fixed topology Q is given by [43] 

<g(0)g(aO><a = -£ + ^ («* + + °(^ 3 ) + 0(e-v'*') (5.28) 

with %4 defined in Eq. (|3.8p . Rcf. [43] proposes to extract \ from the above formula, or from an analogous 
behavior of the correlation function of flavor singlet pseudo-scalar densities, by performing MC simulations at 
fixed topological sector, using overlap fermions and a modified lattice action which suppresses zero eigenval- 
ues, and therefore prevents any change of the topological charge. Results for two-flavor QCD are presented 
in Ref. [42]. 



5.5 The U(1)a problem on the lattice with Ginsparg- Wilson fermions 

The large- N solution of the U(1)a problem and the Witten-Veneziano formula for the rf mass has also been 
investigated on the lattice using Ginsparg- Wilson fermions. We report the main steps of its derivation as 
outlined in Ref. [279], which is based on the lattice topological charge density q%(x) = ^Tr [j 5 D(x, x)] defined 
from a Dirac operator satisfying the GW relation (I4.14|) . such as the Neuberger overlap Dirac operator. For 
more details see the original reference [279]. 

In the presence of Nf massless flavors, using the results of Sec. 14.2.21 and in particular the symmetry 
(|4.15p . one can write the lattice anomalous flavor-singlet Ward-Takahashi indentity 

V M (A M P) = 2N f ( qi (x)P) + (SP), (5.29) 

where is the singlet axial current, P is a generic product of local operators, and SP the corresponding 
variation under the symmetry (|4.15p . The operators qi(x) and the divergence A^(x) are not RG invariant 
operators, as already discussed in Sec. 15.11 for their continuum counterparts. They can be renormalized by 
analogy to the continuum case, leading to renormalized operators q( r \x) and A^\ 

q r (x) = qi(x) - r^r-V^A^a;), A^ r = (1 - z)A ll (5.30) 
2N } 

where z diverges logarithmically at two-loop order, and vanishes when u = Nf/N — * 0, cf. Eq. (|5.8[) . This 
allows us to write a renormalized Ward-Takahashi identity analogous to (I5.29|) . 

A formula for the rf mass in the limit u — ► can be then obtained by assuming that in this limit the rf 
mass vanishes as 0{u). For vanishing quark mass, one obtains 

hm p ^ lim u ^ J d 4 xe- ipx V^(A^ r q r (0)) = ^-m\,\ u=Q (5.31) 

because the integral is determined by the rf pole. Here, /,,< = f n has been used, and / w is defined as in 
Sec. 12.31 cf. Eqs. (|2.13H2.15)) . On the other hand, using the renormalized Ward-Takahashi identity analogous 
to Eq. (|5.29p . one also arrives at 

linip^o lim^o ^7 / d A xe^ lpx \7 ^{A^ r q r {Q)) = lim p ^o lim„^ / d A xe lpx {q l {x)q l {Q)) . (5.32) 
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Figure 7: The scaling ratio C = x/® 2 versus a 2 a, where a is the string tension and a 2 a is the expected order 
of the scaling corrections, for SU(3), SU(4) and SU(6) gauge theories, obtained using a cooling technique. 
From Ref. [195]. 



Finally, Eqs. ([BUT]) and lead to 



which provides a lattice version of the Witten-Veneziano formula. 

6 Results from lattice Monte Carlo simulations 

The dependence of 4D SU (N) gauge theories has been investigated by Monte Carlo simulations of their 
lattice formulation, as described in Sec. 14.11 The lattice action corresponding to the Lagrangian (jl.ip cannot 
be directly simulated for ^ 0, by virtue of the complex nature of the 8 term. On the other hand, the 
coefficients in the expansion of the scaling ground-state energy f(9), cf. Eq. (|3.ip , around 6 = 0, i.e. C and 
&2i, can be accessed by determining the moments of the topological charge distribution at 9 = 0. They are 
dimcnsionless rcnormalization-group invariant quantities, which approach a constant in the continuum limit, 
with 0{a 2 ) scaling corrections. 

6.1 The topological susceptibility in the 4D SU(3) gauge theory- 
Many studies have been devoted to the determination of the topological susceptibility of the 4D SU (3) pure 
gauge theory. Geometrical, smoothing and off-equilibrium techniques have been used to address the problems 
caused by power-divergent additive contributions and multiplicative renormalizations in definitions of the 
topological susceptibility based on bosonic discretized versions of the topological charge density operator q{x). 
As discussed in the preceding section, these methods have their drawbacks, since their systematic errors are 
not under robust theoretical control. Substantial progress has been achieved after the introduction of a 
fermionic definition through the index of the overlap Dirac operator [427], which provides a well-defined 
estimator for the topological charge [387], even in the case of pure gauge theories. This method circumvents 
the problem of renormalization arising in bosonic approaches, albeit at a much higher computational cost. 

In Table [1] we report several results for the topological susceptibility as obtained by various methods. 
We list the scaling quantities C = xl® 2 an d X r o (where ro is the length scale defined in [503], which can 
be related to the string tension a through the estimate [434] ct 1 / 2 ^ = 1.193(10)), and the value x 1 ^ 4 m 
MeV as reported in the corresponding paper. In order to check consistency among the various results, the 
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Table 1: MC results for the topological susceptibility of the 4D SU(3) pure gauge theory. We report the 
type of lattice action (Wilson action [545] (WA), improved action introduced in Ref. [333] (IA), tadpole 
improved Liischer-Weisz action [19,165,389] (TILW), one-loop Symanzik improved (11SI), classically perfect 
action [434] (FP), doubly blocked Wilson action [511] (DBW2)), the method employed to determine the 
topological susceptibility, the scaling quantities C = x/c 2 and x r o (where r is the length scale defined 
in [503]), and the value of x 1 ^ 4 in MeV, which is often derived by using the typical value ^fa — 440 MeV 
(some authors prefer y/a — 420 MeV) or r = 0.5 fm. The data marked by an asterisk have been derived by 
us (the original references do not report them), using also results for a and ro in the literature, such as the 
estimate [434] a 1 ' 2 ^ = 1.193(10) (thus a 2 r% = 2.03(7)), and values for era 2 at various /3-values [375]. 



Ref. 


year 


Lattice action 


Method 


C = Xl° 2 


4 


X 1/4 (McV) 


[222] 


2007 


WA (3 


= [5.9,6.3] 


HYP smearing 


0.0259(10)* 


0.0524(13) 


193(9) 


[90] 


2006 


WA 13 


= 5.85 


overlap HF 


0.035(2)* 


0.071(3) 


190(3)* 


[85] 


2006 


DBW2 (3 = 0.87 


APE smearing 


0.028(2)* 


0.056(4) 


179(4)* 


[191] 


2005 


WA (3 


= [5.8,6.2] 


overlap 


0.029(2)* 


0.059(3) 


191(5) 


[33] 


2005 


WA (3 


= 6.0 


heating 


0.0263(8)* 


0.053(3)* 


173.4(1.7)± i 


[187] 


2005 


TILW 


13 = 7.2 


overlap 


0.027(4)* 
0.025± 2 


0.055(7) 


191 


[198] 


2004 


WA (3 


= [5.9,6.1] 


overlap 


0.055(10) 


188(17) 


[276] 


2003 


WA (3 


= [5.84,6.14] 


overlap 


0.029(3)* 


0.059(5) 


195(4)* 


[83] 


2003 


11SI 13 


= 8.0, 8.4 


Boulder/HYP 


0.030(3)* 


0.061(5) 


183(4)* 


[360] 


2003 


WA (3 


= 5.94 


APE smearing 


0.030(2)* 


0.061(3) 


183(3)* 


[360] 


2003 


WA (3 


= 5.94 


overlap 


0.031(2)* 


0.063(3) 


185(3)* 


[142] 


2003 


WA 13 


= 6.0 


overlap 


0.026(3)* 


0.052(6)* 


175(6) 


[164] 


2002 


WA (3 


= 5.9 


overlap 


0.0211(53) 


0.043(11)* 


168(11)* 


[195] 


2002 


WA (3 


= [5.9,6.2] 


cooling 


0.0282(12) 


0.057(3)* 


180(2) 


[271] 


2002 


TILW 


13 = [8.1,8.6] 


overlap 






191(5) 


[306] 


2002 


FP (3 - 


= [3.0,3.2] 


overlap 


0.030(4)* 


0.0612(75) 


196(6) 


[185] 


2002 


WA (3 


= 5.9 


overlap 


0.043(6)* 


0.087(13)* 


213(7) 


[375] 


2001 


WA 13 


-[5.7,6.2] 


cooling 


0.035(3) 


0.072(7)* 


191(4)* 


[21] 


2001 


IA (3 = 


= [1.8,2.1] 


cooling 


0.0333(27) 


0.0570(43) 


107+13 

iy ' -16 


[300] 


2000 


WA (3 


= 6.0 


APE smearing 






193(4) 


[67] 


2000 


WA (3 


= 5.7 


fermionic 


0.0263(24)* 


0.053(5)* 


188(5) 


[230] 


1999 


WA (3 


= [5.7,6.0] 


overlap 


0.038(8)* 


0.077(16)* 


194(10) 


[303] 


1998 


WA 13 


- [5.85,6.1] 


APE smearing 


0.036(4) 


0.074(9)* 


203(5) 


[174] 


1998 


WA 13 


= [5.85,6.0] 


improved cooling 






183(10) 


[501] 


1998 


WA (3 


= [6.0,6.4] 


cooling 


0.032(9) 


0.065(19)* 


187(22) 


[30] 


1997 


WA 13 


= [5.9,6.1] 


heating 


0.028(4)* 


0.057(8)* 


170(7) 


[534] 


1995 


WA (3 


= 6.0 


heating 


0.04(1) 


0.08(2)* 


190(12) 


[282] 


1988 


WA 13 


= 6.0 


geometrical 


0.047(5)* 


0.095(11)* 


205(5)* 


[515] 


1988 


WA f3 


= [5.6,6.2] 


cooling 


0.033(3) 


0.067(7)* 


179(4) 
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Figure 8: Extrapolation to the continuum limit of the scaling quantity \ r o {N — 3). Data obtained by the 
overlap method for two different values of the arbitrary parameter p entering the definition of the overlap 
definition. From Ref. [191]. 
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Figure 9: The scaling ratio C = x/ " 2 versus a 2 a for the SU (3) gauge theory, obtained by using the heating 
method to determine the lattice renormalizations in Eq. (|5.17l) (the data for % at /3 = 5.9, 6.0, 6.1 are taken 
from Ref. [30] and correspond to the 2-smeared operator, while the data for the string tension at the same 
values of (3 come from Ref. [375]). They suggest the continuum extrapolation C — 0.028(4) (a linear fit to 
C + ba would give C = 0.032(5)). 
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comparison of the scaling quantities C and x r o ^ s more significant than the value of x 1 / 4 in MeV, which 
may depend on the choice of the scale (typical values used for the string tension are ^fa = 420-440 MeV, 
and ro = 0.5 fm for the Sommer scale, but other choices have been also used in some cases). In the most 
accurate numerical works the continuum limit of the scaling quantities x/c 2 or X r o ^ s obtained by taking 
into account the expected 0(a 2 ) corrections. These scaling corrections are clearly exemplified in Figs. [7] and 
[H where some MC results for the scaling ratio x/ ' 2 are plotted versus a 2 a (data from Ref. [195], using 
the cooling method), and for the scaling quantity x r t versus a 2 /r 2 , (data from Ref. [191], using the overlap 
method). 

The recent studies based on the overlap definition, see Table [TJ have led to a quite precise estimate 
of the topological susceptibility of the pure SU(3) gauge theory. The most accurate estimate using this 
definition is [191] x r o — 0.059(3), corresponding to C = 0.029(2). It is important to note that the results 
obtained by the (less computer-power demanding) bosonic methods are substantially consistent, showing 
their effectiveness although they are supported by a weaker theoretical ground. For example, we mention 
the results: C = 0.0282(12) obtained using cooling [195], and C = 0.0259(10) [222] using the HYP smoothing 
method [302]. Fig. [5] shows results for the scaling ratio C = x/ " 2 as obtained from the heating method 
(the data for x at /3 = 5.9, 6.0, 6.1 are taken from Ref. [30] and correspond to the 2-smeared operator, 
while the data for the string tension at the same values of (3 come from Ref. [375]); they suggest the 
continuum extrapolation C = 0.028(4). We also mention that comparisons of determinations of x by cooling 
and fermionic overlap methods on the same sample of configurations have been done in Refs. [164,556], 
observing a strong correlation between them at the typical values of p where MC simulations are actually 
performed. Ref. [90] used an overlap hypercubic fermion to determine x, which was a variant of the standard 
overlap operator, replacing the Wilson operator D\y in Eq. (|4.28|) with another hypercubic kernel which 
improves the locality properties of the overlap Dirac operator. 

The values of x 1 ^ 4 can be compared with the Witten-Veneziano formulae, cf. Sec. 12.31 The r.h.s. of 
Eq. (|2.13p . gives x 1 ^ 4 ~ 190 MeV using the actual values of /„■, m^, and Nf = 3; using the formula refined 
by Veneziano [531], cf. Eq. (|2.15p . for which m 2 , — ► m 2 , + m 2 — 2m 2 K , one obtains x 1/4 « 180 MeV. The 
agreement of the results reported in Table [T] is remarkable. The results obtained in recent years suggest the 
estimates 

C = xlo 2 = 0.028(2), xA = 0.057(5), (6.1) 

where the error is intended to take into account the spread and the typical uncertainty of the estimates of 
X- Then, using the quite standard values \fa — 440 MeV and ro = 0.5 fm, we obtain respectively: 

X 1/4 = 180(3) MeV, x 1/4 = 193(4) MeV. (6.2) 

The two estimates are in reasonable agreement with the Witten and Veneziano formulae, cf. Eqs. (|2.13| and 
(|2.15p ; the difference between the estimates is of course related to the uncertainty in the physical scale for a 
and for ro. 

We also mention that the topological susceptibility of a pure SU(3) gauge theory has been also studied 
by QCD spectral sum rule methods [419-421], leading to the estimate x 1 ^ 4 ~ 120 MeV, and by the field- 
correlator method [138], obtaining x 1 ^ 4 = 196(7) MeV. 

Results for the 4D SU{2) gauge theory can be found in Refs. [27,31,86,87,120,175,181,182,184,208,229, 
230, 263, 288, 363, 364, 375, 379, 403, 418, 456, 517, 553] . 

Topological structures, such as instanton-like configurations, have been investigated on the lattice in 
Refs. [104, 110, 111, 144, 164, 178, 179, 182, 183, 225, 229, 244, 262, 268, 300, 303, 320, 324-327, 332, 379, 402, 405, 
504,529]. In particular, Refs. [104,110,111,179,225,228,320,324-327] investigated topological structures 
beyond instanton-like configurations. Some reviews of these results can be found in Refs. [109,520]. Issues 
concerning the relation of the topological structures to the chiral structure of the lowest fermionic eigenstates 
have been discussed in Refs. [96, 109, 164, 178, 179, 223, 225, 268-270, 288, 315, 323, 326, 329] . 

6.2 The topological susceptibility for N > 3 and in the large-iV limit 

The large- TV limit of 4D SU (N) gauge theories can be investigated numerically, by performing Monte Carlo 
simulations of their lattice formulation, for various values of N, and checking the convergence to N = oo 
with the expected l/N 2 approach. This idea has been recently exploited in several papers, see Refs. [106, 
107, 190, 192, 194-197, 375, 377-380, 401, 414-416, 518] . 
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Figure 10: Comparison between overlap and cooling determinations of topological quantities on the same 
sample of configurations, for N — 2,3,4,5, at the same value a^fa ~ 0.261 (corresponding to j3 = 5.9 
at N = 3 using the Wilson action). In particular, the figure shows data for the quantities (Q 2 ) — (Qf) 2 
(circles), (Q 2 ) — (Q g ) 2 (squares), and ((Qf — Q g ) 2 } (diamonds) where Qf and Q g are the topological charges 
as determined by the overlap and cooling methods respectively. From Ref. [164]. 



The topological properties of 4D SU (N) gauge theories in the large- TV limit have also been examined using 
this approach in several numerical investigations, providing interesting checks of the predictions obtained 
using large- N arguments, discussed in Sec. 13.21 The main limitation of these numerical studies is essentially 
related to the fact that the correct sampling of the topological charge becomes more and more difficult at 
large N, due to a peculiar phenomenon of critical slowing down, which affects the topological modes and 
which significantly worsens with increasing N [195,379]. We shall return to this point in Sec. [TO] 

Results for N > 3 can be found in Rcfs. [164,195,375,379], up to N = 8. They are listed in Tabled They 
were essentially obtained by the cooling method. The only exception is Ref. [164] which presents a comparison 
between measurements using the cooling and overlap methods on the same sample of configurations, up to 
N = 5. Some results are shown in Fig. I10[ which presents various averages involving lattice topological 
charges Qf and Q g obtained using the overlap and cooling methods respectively, from MC simulations 
at fixed ay/a — 0.261 and for various values of N, i.e. N = 2,3,4,5. Although the sample size in this 
comparison is relatively small, it clearly shows that there is a strong correlation between the cooling and 
overlap determinations, which increases with increasing N. 

MC data for N — 3,4, 5, 6, 8 are shown in Fig. [TTJ They fit well the expected large- N behavior: 

C = C x + ||, (6.3) 
thus providing an estimate of Coo, and therefore of the topological susceptibility in the large- N limit: 

Coo = 0.0221(14), 

Coo = 0.0200(43), (6.4) 
Coo = 0.0248(18), 

respectively from Ref. [195], Ref. [375], and Ref. [379]; the latter was obtained using N < 8 and keeping 
a = 1/5T C fixed, where T c is the critical temperature at the deconfinement transition. The coefficient C2 in 
Eq. (|6.3[) turns out to be quite small, [195] 

c 2 ~ 0.06. (6.5) 
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Table 2: MC results for the topological susceptibility of the 4D SU(N) gauge theory with N > 3. All of 
them have been obtained using the Wilson action and the cooling method, with the only exception of the 
results of Ref. [164] which were obtained using the overlap definition. Only the results of Refs. [195,375] were 
obtained by extrapolating the data to the continuum limit. The data shown from Ref. [379] were obtained 
using values of Q rounded to the appropriate neighboring integer value; they correspond to a fixed value of 
the lattice spacing a = 1/5T C . Results from Ref. [164] were taken at one value of (3 only. 



N Ref. C = x/<y' z 

4 [375] 0.0224(39) 
[195] 0.0257(10) 
[164] 0.0213(47) 
[379] 0.0301(10) 

5 [375] 0.0224(49) 
[164] 0.0234(51) 

6 [195] 0.0236(10) 
[379] 0.0265(14) 

8 [379] 0.0236(31) 
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Figure 11: Results for C = x/v 2 versus 1/N 2 , for N = 3, 4, 5, 6, 8. We show MC data taken from Refs. [195] 
(filled circle, by cooling), [375] (square, by cooling), [379] (diamonds, by cooling), [164] (left triangle, by 
overlap), [191] (cross, by overlap), [30] (triangle, by heating), [222] (open circle, by HYP smoothing). The 
result on the y-axis, indicated by an asterisk, shows the extrapolations to N = oo obtained in Ref. [195]. 
Some data have been slightly shifted along the x-axis to make them more visible. 
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Table 3: Results for the coefficient 62 of the 0(# 4 ) term in the expansion f)3.5[) of the ground-state energy 
around 6 — 0. 



N Ref. 



method 



3 [195] 
[199] 
[277] 



cooling 
heating 
overlap 
cooling 
cooling 



0.023(7) 
0.024(6) 
0.025(9) 
0.013(7) 
0.01(2) 



4 [195] 
6 [195] 



We stress that the good agreement for N = 3 of the cooling method with the more rigorous overlap result 
make us quite confident on the reliability of cooling results for higher values of N, since there are no 
arguments to suggest that this agreement could be spoiled with increasing N; actually, there are arguments 
and evidence in favor of improved agreement [164,458], as shown by the data presented in Fig. [10] 

These results are in substantial agreement with the large- N relation (|3. 19|) . In order to compare them 
to the Witten-Veneziano formulae, one may translate them in physical units assuming the standard value 



= 440 MeV, thus obtaining x£^ = 170(3) MeV from Ref. [195], x^ 4 = 165(9) MeV from Ref. [375], and 
X U 4 = 175(3) MeV from Ref. [379]. 



We finally mention some interesting comparisons of instanton size distributions for various values of N 
which have been presented in Refs. [164,379]. They provide significant evidence for the suppression of small 
instantons with increasing N. Fig. [12] shows the distribution of instanton size as obtained after a number 
of cooling steps at a = 1/8T C fixed and for various values of N, up to N = 8. Besides the suppression of 
small instantons, one also observes a rapid narrowing of the distribution as N grows. The data suggest that 
at large N the distribution is peaked around a scale p ~ 1/T C , where T c is the critical temperature of the 
deconfinement transition [164]. The suppression of small topological structures can be also explained by 
semiclassical instanton calculations, which shows that the instanton size distribution is suppressed for small 
size p as [285] 



suggesting that small instantons are heavily suppressed with increasing N . Instanton-size distributions 
qualitatively similar to those shown in Fig.[T2]have been also obtained within the instanton liquid model [469] . 

6.3 The 0(6 4 ) term in the expansion of the ground-state energy 

Higher moments of the topological charge distribution provide estimates of the coefficients &2n in the ex- 
pansion of the scaling energy density f(0), cf. Eqs. (|3.2p and (|3.5p . In particular 62 can be estimated using 
formulae (|3.7p . (|3.8p . Note that a reasonably accurate computation of Xi m a MC simulation, which is 
necessary to compute 62, requires very high statistics due to a large cancellation between the two terms in 
its definition (|3.8[) . Of course, things worsen for the determination of higher-order coefficients &2n- 

There are a number of results for 62 at N = 3, obtained by different approaches: Ref. [195] used the cooling 
method, Ref. [199] used the heating technique to estimate additive and multiplicative renormalizations in 
zero-momentum correlations of lattice discretizations of q(x), and finally Ref. [277] used the overlap method. 
The results reported in Tableware in good agreement, suggesting that the systematic errors of the various 
methods are sufficiently small. The fourth moment of the topological charge distribution was investigated 
also in Ref. [222], obtaining substantially consistent results, although the authors left open the possibility 
that the ratio Xi/Xi an d therefore 62, may vanish in the large-volume limit. 

The results of Table [3] provide robust evidence that 62 is nonzero, and therefore that there are deviations 
from a Gaussian distribution of the topological charge. However, 62 turns out to be quite small, indeed 
[62! ^ 1- Thus deviations from a simple Gaussian behavior are already small at TV = 3. 




(6.6) 
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There are also estimates for larger values of A, see Tabled but only using the cooling method. Again, 
given the agreement found at N = 3, higher A results should be sufficiently reliable. They appear to decrease 
consistently with the expectation from the large- A scaling arguments, i.e. 

b 2 m with b 2 pa -0.2. (6.7) 

Fig. [TBI shows a plot of the available results for b 2 , actually N 2 b 2 , versus A. 

Overall, these results support the scenario obtained by general large- N scaling arguments, which indicate 
9 = 9/N as the relevant Lagrangian parameter in the large- A expansion. They also show that A = 3 is 
already in the regime of the large- A behavior. For A > 3 the simple quadratic form 

F (6) - F(0) a 1 - X 9 2 (6.8) 

provides a good approximation of the dependence on 9 for a relatively large range of values of 9 around 
6» = 0. 



6.4 9 dependence at finite temperature and across the phase transition 

Another interesting issue concerns the behavior of topological properties at finite temperature, and in partic- 
ular their change at the finite-temperature deconfining transition, which is first order for A > 3, and second 
order for N — 2; see e.g. Refs. [376,377,380] and references therein. 

At high temperature, T>T C where T c is the transition temperature, one can compute the 9 dependence 
semiclassically. At zero temperature semiclassical calculations fail because of the absence of any large- 
distance cutoff on the instanton length scale [116]. However, at finite temperature, the temperature T, which 
is related to the size of the Euclidean time dimension, is expected to act as a natural infrared cutoff [285]. 
Consequently, at sufficiently high temperature, say T ^> T c , one may compute the 9 dependence from the 
one-loop contribution of instantons to the functional integral, obtaining [285] 

F{9) - F(0) - (1 - cos6»)T 4 exp[-8TT 2 /g 2 (T)], (6.9) 

where g(T) is the running coupling constant at the scale T. In the case of a pure gauge theory, 

8^ 2 11 



9 2 {T) 3 



■JVlnT (6.10) 



asymptotically at large T. This shows that the 9 dependence gets suppressed at high temperature. 

The finite-temperature behavior of the topological susceptibility, and in particular its behavior across 
the transition, has been investigated in several numerical MC works, see Refs. [30,101,174,196,206,226,248, 
270-272,379,515], using different methods to determine the topological susceptibility. 

Results for the SU(3) gauge theories can be found in Refs. [30,226,271,379]. They have been obtained by 
the heating method [30], the cooling method [379], and the overlap method [226,271], and give substantially 
consistent results. They show that the topological properties, and in particular the topological susceptibility 
X, vary very little up to T < T c . They change across the transition, where x shows a significant decrease. 
For example [271], x decreases by approximately a factor 13 as the temperature is increased from 0.88 T c to 
1.31 T c . Then, at high temperature, T ^> T c , where the instanton calculus (|6.9p should become reliable [285], 

X ~T 4 exp[-8^ 2 /3 2 (r)]. (6.11) 

Concerning the large-A behavior (investigated by performing simulations at various values of N > 3 
[196,379]), the results indicate that x has a nonvanishing large- A limit for T < T c , as at T — 0, and that 
the topological properties, and therefore the ground state energy F(9), remain substantially unchanged in 
the low-temperature phase, up to T c . On the other hand, above the deconfinement phase transition, T > T c , 
X shows a large suppression, hinting at a vanishing large- A limit for T > T c . 

Fig. [14] shows results for the scaling ratio 

R(T) . ^ (6.12) 
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Figure 12: The instanton size density, D(p), for N = 2(*), 3(+), 4(o), 8(») on 16 4 lattices with a ~ 1/8T C . 
From Ref. [379]. 



0.4 



0.0 
N 2 b 2 

-0.4 




-1.2- 



iV 



Figure 13: Plot of 7V 2 6 2 versus iV, for N = 3,4,6. We show MC data taken from Ref. [195] (circles, by 
cooling), from Ref. [199] (square, by heating), from Ref. [277] (diamond, by overlap). Some data for N = 3 
have been slightly shifted along the x-axis to make them more visible. 
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Figure 14: The ratio R{T) = X (T)/x{T = 0) as a function of the reduced temperature t = T/T c - 1. For 
each SU(N) gauge theory, these curves are expected to converge to a continuum curve when L t — > oo, where 
L t is the length, in units of the lattice spacing, of the lattice along the Euclidean time direction. From 
Ref. [196]. 



versus the reduced temperature t = T/T c — 1 for 4D SU(N) gauge theories with N = 4, 6 [196], and 
Euclidean temporal size L t = 6,8 (we recall that the temperature is related to L t by 1/T = aL t ). One can 
immediately observe that its behavior is drastically different in the low- and high-temperature phases. In 
the low-temperature phase, all data for N = 4, L t = 6, 8 and N = 6, L t = 6 appear to lie on the same curve, 
showing that scaling corrections are small and also that the large- N limit is quickly approached. The ratio 
R remains constant and compatible with the value R — 1. As shown in Ref. [379], also the instanton size 
distribution appears substantially unchanged for T < T c , Only close to T c , i.e. for T > 0.97 T c , does this 
ratio appear to decrease. These results show that in the confined phase the topological properties remain 
substantially unchanged up to T c . On the other hand, above the deconfinement phase transition, \ shows a 
significant decrease. The comparison between the N = 4 and N = 6 data shows that the ratio R decreases 
much faster for N = 6, hinting at a vanishing large- N limit of R for T > T c . A comparison with the results 
for N — 3 of Refs. [30,271] suggests that the suppression of topological fluctuations is faster in ST/ (4) than 
it is in SU(3). 

Numerical results supporting the same picture have also been reported in Ref. [377] . The numerical evi- 
dence of the topological suppression across the transition was inferred from simulations at T c , by monitoring 
the correlation of the topological charge with the Polyakov line, whose value is used to infer the actual phase 
of the configurations generated along the given Monte Carlo run for N > 3 at the first-order transition. This 
has led to estimates of the topological susceptibility in the deconfined and confined phase at T c , whose ratio 
is shown in Fig. [15] for various N. 

Ref. [196] presented also some results for the finite-temperature behavior of the coefficient 62 of the 6 
expansion around 8 = 0: it remains small and substantially unchanged up to T ~ T c , then it appears to 
slightly increase at T >T C . 

Summarizing, the lattice results suggest that physical properties determined by topological effects remain 
substantially unchanged in the low-temperature confined phase. On the other hand, in the high-temperature 
phase there is a sharp change of regime where the topological susceptibility is largely suppressed. Such 
suppression becomes larger with increasing N, suggesting that the topological susceptibility vanishes above 
the critical temperature. Monte Carlo results seem to support the scenario presented in [351]: at large N 
the topological properties in the high-temperature phase are essentially determined by instantons, from very 
high temperature down to T c ; the exponential suppression of instantons, as e~ N , induces the rapid decrease 
of the topological activity observed in the large- N limit. 
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Figure 15: The ratio of the topological susceptibility in the deconfined and confined phases at T c . Data 
from Ref. [377]. 



6.5 Results for the 9 dependence of the spectrum 

Another interesting issue concerns the 9 dependence of the spectrum of the theory. A numerical MC study 
for 4D SU{N) gauge theories was reported in Ref. [192]. Again numerical simulations of the Wilson lattice 
formulation were employed to investigate the 9 dependence of the string tension a(9) and the lowest glueball 
mass M(9). They can be expanded around 9 = as in Eqs. (|3~TTj) and (f37T2|) . Then the coefficients Sj 
and gi of these expansions can be computed from appropriate correlators at 9 = 0. In particular, s 2 can 
be determined [192] from the large-i behavior of connected correlation functions of two Polyakov lines at 
distance t and the square topological charge, such as 



where 



(Ap(t)Q 2 ) e=0 - (A P (t))g = o(Q 2 )e=o 



A P {t) = J2 TrPt(0;0) Tr P(x 1 ,x 2 - 1 t), 



.13) 



(6.14) 



P(xi, x 2 ; t) is the Polyakov line of size L along the x% direction, and Q is the topological charge. Analogously, 
the 0{9 2 ) term of the glueball mass can be obtained from appropriate connected correlation functions of 
plaquette operators and Q 2 . The 0(9 2 ) coefficients s 2 and g 2 , and in general all coefficients of those 9 
expansions, are dimensionless scaling quantities, which should approach a constant in the continuum limit, 
with 0(a 2 ) scaling corrections. 

Ref. [192] obtained the first estimates of s 2 and g 2 , cf. Eqs. (|3. 1 1|) and (|3.12|) . using the cooling method 
to determine the topological charge, and for N — 3,4, 6 to also check their large- N behavior. The 0{9 2 ) 
terms in the expansion around 9 = of the spectrum of SU{N) gauge theories are small for all N > 3. For 
example we mention the estimates 



S2 



-0.08(1), g 2 = -0.06(2) for N 



(6.15) 



The 9 dependence appears even smaller when dimensionless combinations are considered, such as M / y/a 
and, for N > 3, the ratio of tensions corresponding to strings in different representations. For example, in 
the case of the scaling ratio 



M(9) 



^(l + r 2 9 2 + ...), 



(6.16) 



we find r 2 = g 2 - s 2 /2, thus r 2 = -0.02(2) for N = 3. 
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The 0(9 2 ) corrections appear to decrease with increasing N, and the coefficients do not show evidence of 
convergence to a nonzero value. This is suggestive of a scenario in which the dependence of the spectrum 
disappears in the large- N limit, at least for sufficiently small values of 9 around 9 = 0. In the case of 
the spectrum, the general large-TV scaling arguments of Sec. 13.21 which indicate 9 = 6/N as the relevant 
Lagrangian parameter in the large- iV limit, imply that 0(9 2 ) coefficients should decrease as 1/N 2 . The 
results of Ref. [192] appear substantially consistent: In the case of the string tension they suggest 

ct(0) = a (1 + s 2 9 2 + ...) , s 2 «-0.9/iV 2 . (6.17) 

Further investigation would be useful to put this scenario on a firmer ground, using for example other 
definitions of topological charge. 



7 Results in full QCD 

In this section we discuss a number of issues related to the 9 dependence and U(1)a symmetry breaking in 
QCD. We recall that in the presence of Nf fermions, the general QCD Lagrangian with a 9 term reads 

£ e = \f« v {x)F« v (x) +^^ + ^Mfe + feMt Vl - *9 q q(x), (7.1) 

where tp is an Nf component vector in flavor space, and M is the quark mass matrix. As already discussed 
in Sec. 12.11 the parameter 9 q and the imaginary part of the quark mass are related, so that the relevant 9 
parameter is given by 

= 9 q + argdetM. (7.2) 
7.1 The neutron electric dipole moment 

In strong interactions, one of the most stringent constraints on possible violations of parity and time reversal 
symmetries is inferred from measurements of the neutron electric dipole moment (NEDM). The experimental 
analyses of Refs. [293] and [61] lead to the upper bounds 

\d n \ < 6.3 x l(r 26 ecm, \d n \ < 2.9 x 1(T 26 e cm, (7.3) 

respectively. In the standard model, CP violation in the Cabibbo-Kobayashi-Maskawa mixing matrix gives 
also rise to a nonzero NEDM, but only beyond the one-loop approximation in the weak interaction, thus 
leading to a very small value [234,410], i.e. |tf n | < 10 _30 ecm, much smaller than the actual experimental 
bound ([T3"]) . 

In order to translate the above experimental bound into a constraint on 0, one needs to compute the 
NEDM in QCD in the presence of a term, i.e. to compute the quantity c n defined as 

d n ~ c n 9 e fm (7-4) 

in the presence of a small term. Calculations have been done within models, such as the bag models [64, 
406,409], the instanton liquid model [241], holographic QCD [317], or by approaches based on a CP violating 
effective chiral Lagrangian and chiral perturbation theory (ChPT) [47,100,135,162,213,341,349,448], on 
QCD sum rules [454,484], etc... Table [4] presents a list of results for c n . 

The present theoretical estimations of the NEDM in the presence of a term significantly vary among 
the different models, approaches and approximations which are considered. It is hard to get an accurate 
estimate from the results reported in Table [4] There is a substantial global agreement on the size of the 
quantity c„, indeed they suggest 

0-001 < |c„| < 0.01, (7.5) 

however, there is not a complete agreement on the sign of c„ . Anyway, these crude estimates of c n strongly 
suggest that is very small: By taking as lower bound |c„| > 0.001, the experimental upper bounds (|7.3[) 
lead to 

|0| <O(10- 10 ). (7.6) 



41 



Table 4: Estimates of the NEDM in the presence of a 9 term. We report the quantity c n defined by writing 
d n = c n Oelm. Results from lattice QCD are reported in the text. 



Rcf. 


year 


approach / model 


C-n 


[64] 


1979 


bag model 


0.0027 


[162] 


1980 


ChPT 


0.0036 


[349] 


1981 


ChPT 


0.001 


[341] 


1981 


ChPT 


0.0055 


[478] 


1982 


ChPT 


0.02 


[409] 


1984 


chiral bag model 


0.0030 


[474] 


1984 


soft pion Skyrmion model 


0.0012 


[130] 


1984 


single nucleon contribution 


0.011 


[448] 


1991 


ChPT 


0.0033(18) 


[135] 


1991 


ChPT 


0.0048 


[47] 


1992 


ChPT 


-0.0072,-0.0039 


[454] 


1999 


sum rules 


0.0024(10) 


[100] 


2000 


heavy baryon ChPT 


-0.0075(32) 


[241] 


2004 


instanton liquid 


0.010(4) 


[317] 


2007 


holographic QCD 


0.00108 



Clearly, an accurate estimate of c„ is essential, to translate the experimental bound on the NEDM into a 
reliable stringent bound on 9. 

The computation of the ratio c n = d n /9 can also be addressed by calculations within the lattice formu- 
lation of QCD, which represents the most fundamental source of theoretical nonperturbative information on 
the strong interactions. The dipole moment of nucleons is obtained from the form factors that parametrize 
the electromagnetic current between nucleon states in the 9 vacuum, 

(p',s'\J^\p,s) = u s ,(p')T»{q 2 )u s (p), (7.7) 

where q = p' — p and T p '(q 2 ) has the most general four-vector structure consistent with the gauge, Lorentz, 
and CPT invariance of QCD, see e.g. [85], 

T»(q 2 ) = F 1 (q 2 )Y + J-F 2 (q 2 )ia^q v + fA(? 2 )( 7 V<; 2 - 2m n7 V) + ^F 3 (q 2 )a^ 75 q„. (7.8) 

F 3 {Q)/2m n gives the electric dipole moment, which vanishes when 9^0. In lattice calculations matrix 
elements can be extracted from the large-distance behavior of appropriate correlation functions in Euclidean 
spacetime. In the case at hand one considers three-point correlations such as 

(N(t 2 , p 2 )J? m (t, $) JV+ fa , ft )> , (7.9) 

where N is the interpolating operator which creates the given nucleon. 

The NEDM can be also extracted by computing the variation of the energy of neutron states in the 
presence of an external electric field [45] . In a CP-violating theory, a static constant electric field E gives 
rise to a change in the energy £ of the nucleon state: 

8£Kd n S-E, (7.10) 

where S is the nucleon spin, and d n ~ 9. An estimate of the NEDM is obtained by measuring the energy 
difference of nucleon states with opposite spin along the electric field, i.e. 

£+ -£- = 2d n S-E + 0(\E\ 3 ). (7.11) 



42 



Although the action with 6* ^ cannot be directly studied by numerical Monte Carlo methods, in the 
small 9 limit one can obtain the desired result for c n , Eq. (|7.4p . by expanding around 9 = and taking only 
the linear term, i.e. for a generic expectation value of product of operators P, 

(P)e^(P)e=o + i0(PQ)e=o (7.12) 

where Q is the topological charge. Alternatively, the nucleon electric dipole at finite 9 can be determined 
using reweighting techniques with the complex weight factor e 10 ® . Another approach, pursued in Ref. [336], 
utilizes MC simulations at imaginary 9 values. 

Lattice calculations have been reported and discussed in Refs. [45,46,85,286,336,485-487]. Several in- 
vestigations [45,46,286,485,486] have been performed within the quenched approximation, i.e. neglecting 
fermion loops, leading to the estimate [486,487] c„ w 0.02-0.04 for the linear coefficient defined in Eq. (|7.4j) . 
using both the form factor and external electric field methods. However, calculations within this approxima- 
tion are not expected to give accurate results because they miss the fact that the effect of the 9 term must 
vanish in the chiral limit, and therefore they may overestimate the ratio d n /9 at the physical small values 
of the u and d quark masses. First results for full QCD have been reported only recently [85,336,486,487]. 
Ref. [85] reported a calculation in full QCD with two dynamical domain- wall fermions, by measuring the elec- 
tromagnetic form factor. However, the precision was not sufficient to estimate the ratio d n /9, but provided 
only the bound 

|c„|<0.02, (7.13) 

which is larger than the calculations reported in TablelH see also Eq. (|7.5[) . In Ref. [487] the NEDM has been 
estimated using the external electric field method, with two dynamical clover quarks, obtaining the estimate 
c„ w 0.04 from the smallest quark mass values considered in the MC simulations. Although this estimate 
comes apparently with a relatively large uncertainty, it is significantly larger than the above-mentioned 
estimates by other approaches, summarized by Eq. (|7.6|) . In particular, it confirms the results obtained 
within the quenched approximation, without showing the expected decrease when approaching the chiral 
limit; the original paper contains a discussion of the possible reasons, which, for example, may be related to 
the fact that the small quark-mass regime (chiral limit) has not yet been reached by numerical simulations. 
Therefore, further work is called for, in order to accurately determine the NEDM in the presence of a 9 term 
from lattice QCD at sufficiently small values of the quark masses. 



7.2 The topological susceptibility in the chiral limit 

The behavior of x as a function of the quark masses can be understood in the framework of the chiral 
effective Lagrangian of QCD, see e.g. Refs. [160,219,372,373,541]. In the case of Nf degenerate flavors, the 
topological susceptibility is expected to vanish at small quark masses m as 

xs f +0(ra, » = w + ° ( ^ (7 - 14) 

where £ = —(t/jip) /Nf. Eq. (|7.14|) does not apply to one-flavor QCD, Nf = 1, where there is no spontaneous 
breaking of chiral symmetry (the anomaly breaks the whole axial symmetry) , thus no Goldstone bosons such 
as pions are expected. Refs. [155, 157] discuss the case of one-flavor QCD, and in particular the meaning 
of its massless quark limit. Note that while m and S are not renormalization-group invariant, since each 
one depends on the renormalization-group scheme and on the corresponding scale, their product mS is a 
renormalization-group invariant quantity. When m — > 0, \ vanishes, consistently with the fact that in the 
presence of a massless flavor the 9 dependence disappears. On the other hand, in the limit of large quark 
masses, the topological susceptibility is expected to approach the nonzero value \qo of the pure gauge theory. 
In the large- N limit, it has been argued that [219,373] 

X Jtt,oo iv1 tt,oc X°o 

which describes the crossover behavior of x from the small- M w and large- M„ asymptotic regions. We recall 
that a relation analogous to (|7.14[) holds also on the lattice, at finite lattice spacing, when fermions satisfying 
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Figure 16: x r o vs m q r o where m q is the MS quark mass at the scale // = 2 GeV. We show data from 
Ref. [188] (squares) and from Ref. [42] (circles), obtained by Monte Carlo simulations of two flavors of 
dynamical overlap fermions (the results of Ref. [42] were obtained at fixed topological charge Q = 0, as 
discussed at the end of Sec. 15 .4[) . The dashed line indicates the quenched result, which corresponds to the 
limit m q ro — ► oo. 



the Ginsparg- Wilson relation are considered, see Sec. 14.2.21 In the case of other discretizations of the Dirac 
operator, such as Wilson and staggered fermions, the relation (|7.14[) is expected to hold only in the continuum 
limit. 

Several numerical simulations have been devoted to the study of the behavior of the topological suscep- 
tibility in the presence of dynamical quarks, see Refs. [21, 32, 40, 42, 48, 62, 79, 83, 85, 92, 94, 173, 187, 188, 221, 
233,294,295,300,301,365,366], and in particular to a check of the asymptotic behavior (|7.14| in the regime 
of small masses. The results obtained using Wilson [21,62,221,295] and staggered [32,79,83,94,173,221,365] 
fermions are substantially consistent with the expected behavior (|7. 14|) and (|7.15[) . However, a conclusive 
quantitative check of Eq. (|7.14[) has not been achieved yet. One should not forget that the numerical check 
of Eq. (|7.14|) is subject to scaling corrections (i.e. it is expected to hold only in the continuum limit), to 
possible systematic errors in the measurement of the topological susceptibility depending on the method 
employed, and also to the fact that the topological modes are very slowly sampled in Monte Carlo simula- 
tions (we shall return to this point in Sec. I10p . Results obtained using chirally improved fermions, such as 
domain wall fermions, can be found in Refs. [48,85,366], and are substantially consistent with a suppression 
of the topological susceptibility in the chiral limit. The use of overlap fermions should simplify the check 
of Eq. (|7.14p . because it suppresses some sources of systematic errors. Although overlap fermions require a 
much larger numerical effort, the check of Eq. (|7. 14|) should benefit from the fact that it holds also at finite 
lattice spacing if one employs the fermionic definition of topological charge based on the index theorem. 
Therefore, it can be done on relatively small lattice sizes, without the need of carefully studying the contin- 
uum limit. The results obtained so far are quite promising [42, 188,233]. They show clear evidence of the 
suppression of x when decreasing the quark mass m, consistently with the prediction of chiral perturbation 
theory. In Fig. [THlwe show some recent results from Refs. [42,188], obtained by Monte Carlo simulations of 
two flavors of dynamical overlap fermions; x r o is plotted versus m q r§ , where r$ is the Sommer scale [503] 
and m q is the MS quark mass at the scale fj, = 2 GeV. The data are nicely consistent with the expected 
linear behavior x ~ m q at small quark masses. 

The behavior of x nas a l so been investigated at finite density, in Refs. [36,400]. 
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7.3 The U(1)a axial anomaly at finite temperature 

Another interesting issue concerns the temperature dependence of the U(1)a axial symmetry breaking, 
and, in particular, the possibility of its effective restoration at finite temperature, for example at the chiral 
transition where the chiral SU(Nf)A symmetry is restored in the massless limit. We recall that the anomaly 
equation holds also at finite temperature because of its ultraviolet origin. 

As already discussed in Sec. 12. 1[ at T = the U(1)a axial anomaly gives rise to a quite large splitting 
in mass of pseudoscalar flavor singlet and nonsinglet mesons, such as rj and T}', which have the same quark 
content. This issue has been investigated on the lattice in Refs. [44,257,284,310,370,394,395,473,505]. The 
results for the ij and rj masses are in substantial agreement with the experimental values. 

The thermodynamics of strong interactions is characterized by a transition at T w 200 MeV from a low-T 
hadronic phase, in which chiral symmetry is broken, to a high-T phase with deconfined quarks and gluons 
(quark-gluon plasma), in which chiral symmetry is restored [345,543]. The nature of the phase transition is 
also sensitive to the symmetry breaking pattern, and therefore to whether the U(1)a symmetry is effectively 
restored at T c [69,449,536]. 

A complete suppression of the anomaly effects at T c is however unlikely in QCD. Semiclassical calculations 
in the high-temperature phase [285] show that the instanton contribution is suppressed for T ^> T Cl implying 
a suppression of the anomaly effects in the high-temperature limit, but not their complete vanishing at finite 
T. In Sec. 16. 41 we have discussed the case of pure SU{N) gauge theories, where the topological susceptibility 
appears to suffer a significant decrease across the transition, but remains nonzero above T c for finite N. 

The effects of the U(1)a anomaly at finite temperature in full QCD can be investigated by comparing the 
behavior of fermionic correlators in different meson channels which are related by the U(1)a symmetry, see 
e.g. Refs. [93, 145, 238, 368, 493]. For example, in the case of two flavors, we have the isoscalar I — scalar 
channel a associated with the operator O a = ipip, the 1=1 scalar channel S associated with 0$ = ipTip, and 
the corresponding pseudoscalar I = rj and I = I ir channels associated with the operators O v > = ip^ip 
and Ott = 'tpTjs'ip, respectively. When the SU(2)l <8> SU(2)r chiral symmetry with two massless flavors is 
realized explicitly, rather than spontaneously broken, it implies degeneracies between mesonic correlators. 
SU(2)a transformations mix the a and tt channels, and the S and rj channels. Therefore, above T c , where 
the SU(2)a symmetry is restored, the correlations in the corresponding channels are expected to become 
identical (of course in the chiral limit only), apart from trivial factors, see e.g. Ref. [354]. On the other hand, 
U(1)a transformations relate correlators in the a and rj channels, and correlators in the 6 and 7r channels. 
Therefore an effective further restoring of the U(1)a symmetry at T > T c would lead to equal correlators in 
all these channels, and in particular, to equal susceptibilities and screening masses. 

The issue has been investigated by lattice Monte Carlo studies, see Refs. [80,132,344,354,538]. Although 
lattice results have been obtained for relatively small lattice sizes and convincing checks of scaling in the 
continuum limit have not been reported, they favor the scenario in which the U(1)a symmetry is not restored 
at T c . Fig.[T7l taken from Ref. [344], shows results for the susceptibilities of different channels. Similar results 
have been obtained using domain wall fermions in Ref. [538]. The screening masses in the a and 7r appear 
equal above T c signaling that the chiral symmetry SU(2)l ® SU(2)r is restored above T c . The effective 
breaking of the axial U(1)a symmetry appears substantially reduced above T Cl but the difference in the S 
and 7r channels, and in particular in their screening masses, although small above T c , appears significantly 
different from zero at least up toTs 1.2T C . At even higher temperatures, we expect that this difference 
decrease smoothly to zero, as indicated by semiclassical instanton calculations [285]. 

The U(1)a symmetry breaking at finite temperature has also been discussed within the instanton liquid 
model [471,472] , leading to analogous conclusions. Its effects have been also investigated within the AdS/CFT 
framework in Refs. [78,470]. 

The phase structure of the theory above T c , and in particular a possible effective restoration of the U(1)a 
symmetry above T c , has been investigated in Refs. [392,396,398], by assuming the existence of quark conden- 
sates which break the U(1)a axial symmetry, but are invariant under the symmetry SU {Nf)y <£> SU {N/)a ® 
U(l)v, such as the vacuum expectation value of the 2A/-quark operator det(ipR,iipL,j) + det (t^L,it^R,j) 
(where i,j are flavour indices) and appropriate extensions, which may play a relevant role above T c where 
the bilinear quark condensate (tptp) vanishes. 
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Figure 17: Susceptibilities in different channels for 2-ffavor QCD with staggered fermions on lattices of size 
8 3 x 4, versus [3. The vertical line indicates the value of (3 at the transition. Taken from Ref. [344]. 

Table 5: This Table summarizes the predictions by renormalization-group analyses based on universality 
arguments on the nature of the finite-temperature transition of QCD, for different numbers Nf of light 
flavors, and in the case the effects of the U(1)a axial anomaly are relevant or effectively suppressed. In those 
cases where a continuous transition is possible, the corresponding universality class is reported, and indicated 
by the corresponding symmetry breaking pattern. See Refs. [69,536] for more details and references. 





U(1)a anomaly 


suppressed anomaly at T c 


Nf 


SU(N f ) L ® SU(N f ) R -> SU{N f ) v 


U(N f ) L ® U{N f ) R -> U{Nf) v 


N f = 1 


crossover or first order 


0{2)/Z 2 or first order 


N f = 2 


0(4)/0(3) or first order 


[U(2) l ®U(2)r]/U(2) v or first order 


N f >3 


first order 


first order 



7.4 U{1)a breaking and the nature of the finite-temperature chiral transition 

Our understanding of the nature of the finite-T phase transition in the chiral limit is essentially based on 
the relevant symmetry and symmetry-breaking pattern. In the presence of Nf light quarks, the relevant 
symmetry is the chiral symmetry SU{Nf)i J ® SU(Nf) R . At T = this symmetry is spontaneously broken 
to SU(Nf)v with a nonzero quark condensate ('ipip)- The finite-T transition is related to the restoring of 
the chiral symmetry. It is therefore characterized by the symmetry breaking 

SU(N f ) L ® SU(N f ) R -> SU(N f ) v . (7.16) 

An effective restoring of the U(1)a axial symmetry at the finite-temperature transition can have important 
consequences for the nature of the transition in the chiral limit [449]. Indeed, if the effects of the axial 
anomaly are effectively suppressed at the transition, instead of the symmetry breaking pattern (|7.16[) . we 
would have 

U(Nf) L ®U{Nf) R ^U{N f ) v . (7.17) 

Table [S] lists the possible transitions as predicted by renormalization-group analyses based on universality 
arguments [68,69,115,449,536], for various values of Nf, and for the two symmetry breaking patterns (|7.16p 
and (|7.17p . In those cases where a continuous transition is possible, the corresponding universality class is 
reported. 

In the physically relevant case of Nf = 2 light quarks, renormalization-group arguments show that the 
chiral transition can be continuous, and, in this case, its critical behavior belongs to the 3D 0(4) vector 
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universality class (essentially because the expected symmetry breaking SU(2)l <8> SU{2)r — ► SU(2)v is 
equivalent to 0(4) — > 0(3), as in the 0(4) vector model), characterized by the critical exponents v — 0.749(2) 
and 77 = 0.0365(10) [296,445]. The transition can be continuous also in the case the U(1)a symmetry is 
effectively restored at T c , but in a different [ U(2)l <8> U {2)r ] / U{2)y universality class [68, 69] , characterized 
by the critical exponents v 0.7 and }j w 0.1. As we have discussed above, this should not occur in 
three-color QCD, but it may be relevant in the large-JV c limit, where the anomaly gets suppressed. We also 
recall that the existence of a corresponding universality class does not exclude that the transition may be 
first order, when the system is outside the attraction domain of the stable fixed point of the RG flow which 
describes the universality class. 

The nature of the transition in QCD can be investigated within the lattice formulation of QCD. Many 
studies based on MC simulations of different lattice formulations of QCD have been performed, see e.g. 
Refs. [20,81,200,235,346,355,356], but this issue is still controversial. Some MC results favor a continuous 
transition. However, the results have not been sufficient to settle its 0(4) nature yet. There are also 
claims [200] in favor of a first-order transition. Unlike Nf = 2 QCD, the transition scenario appears settled 
for Nf > 3: MC simulations [136,176,177,334,346] show a first-order transition, in agreement with the RG 
predictions. 

The effects of the U(1)a anomaly on the chiral transition have been numerically investigated in Ref. [133] 
using strongly coupled lattice QED with two flavors as a model for the chiral transition in two-flavor QCD. 
In this model the transition appears first order in the absence of terms breaking the U(l) symmetry corre- 
sponding to the U(1)a symmetry of QCD. Then, the explicit breaking of this U(l) symmetry weakens the 
transition, and turns it to an 0(4) continuous transition for a sufficiently large strength of the U(l) symmetry 
breaking. These features are consistent with one of the different scenarios allowed by RG arguments, which 
are listed in Table [5] However, these results cannot be extended to QCD, because the actual realization of 
one of the scenarios allowed by the RG arguments, i.e. the fact that the transition is first order or continuous 
(when the universality class exists), depends on the details of the model under investigation, not only on the 
global features which characterize the given universality class. 

In nature quarks are not massless, although some of them, the quarks u and d, are light. The physically 
interesting case is QCD with Nf = 2 light quarks and four heavier quarks (in particular the quark s with 
m s w 100 MeV). Therefore, it is important to consider the effects of the quark masses in the above transition 
scenarios. Renormalization-group arguments show that, if the transition is continuous in the chiral limit, 
then an analytic crossover is expected for nonzero values of the quark masses m/, because the quark masses 
act as external fields coupled to the order parameter. On the other hand, a first-order transition is generally 
robust against perturbations, and therefore it is expected to persist for mf > 0, up to an Ising end point. 
Actually, the presence of the massive strange quark s makes the above scenario more complicated, because 
the nature of the transition may be sensitive to its mass m s . Since the transition is expected to be first order 
in the chiral limit of Nf = 3 degenerate quarks, see Table [5j we also expect that the first-order transition 
persists for sufficiently small values of m s . On the other hand, if the transition is continuous in the limit 
m s — > 00 corresponding to Nf = 2 degenerate quarks, then there must be a tricritical point at m s = m* 
(where the critical behavior is as described by mean field theory, apart from logarithms) separating the 
first-order transition line from the 0(4) critical line. 

For quark masses around their physical values, the results of MC simulations, see Refs. [49, 82, 137, 
177], support a crossover scenario, i.e. the low-T hadronic and high-T quark-gluon plasma regimes are not 
separated by a phase transition, but rather by an analytic crossover, where the thermodynamic quantities 
change rapidly in a relatively narrow temperature interval. Nevertheless, the nature of the transition in the 
chiral limit is still of interest. Since the physical masses of the lightest quarks u and d are very small, some 
scaling relations, valid in the chiral limit, may still hold at the physical values of the quark masses. 
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8 6 dependence in the two-dimensional CP N 1 models 

8.1 The two-dimensional CP N ~ l model as a theoretical laboratory 

Issues concerning the 9 dependence can also be discussed in two-dimensional CP^ -1 models [166,548], whose 
Lagrangian is given by 

N 

C = —D^zD^z, (8.1) 
^9 

where z is a complex iV-component scalar held subject to the constraint zz = 1, 

Afj = izd^z (8.2) 

is a composite gauge held, and = + iA^ is a covariant derivative. They provide an interesting 
theoretical laboratory. Indeed they present several features that hold in QCD: asymptotic freedom, gauge 
invariance, existence of a confining potential between non gauge invariant states (that is eventually screened 
by the dynamical constituents), and nontrivial topological structure (semiclassical instanton solutions, 9 
vacua). We recall that the CP 1 model is equivalent to the 0(3) a model, as can be easily seen by a change 
of variables. Indeed, in the case N = 2, the action can be written in a 0(3)-symmctric form by reexpressing 
the z held in terms of s = zaz where <ii are the Pauli matrices. 

An appealing feature of 2D CP N ~ l models is the possibility of perfoming a systematic 1/N expansion, 
keeping g hxed, around the large- N saddle-point solution [122,123,166,548], unlike 4D SU(N) gauge theories. 
This makes these models particularly interesting, because they allow us to also check general nonperturbative 
scenarios by analytic calculations, without necessarily resorting to numerical Monte Carlo methods of their 
lattice formulation. 

Analogously to 4D SU(N) gauge theories, one may add a 9 term to the Lagrangian, writing 



iV — — — „ 1 

— D,,z D,,z — i9 — 
2g 2vr 

where 



Ce = —D^zD^z - iO—e^d^Av, (8.3) 



1 „ . i 



q{x) = — e M „ du,A v = — D^zD v z (8.4) 

Z7T ZTT 

is the topological charge density. Then one may study the 9 dependence of the ground state and other 
observables. As in 4D SU(N) gauge theory, q{x) is a renormalization-group invariant operator. 

8.2 The ground-state energy 

By analogy with SU(N) gauge theories, the ground state energy F(9), dehned as in Eq. (|1.4p . depends on 
9. One may dehne a dimensionlcss function f{&) related to the ground state energy, 

f(9)^e{F(9)-F(0)}, (8.5) 

where £ = £(6 — 0) is a length scale dehned at 9 = 0. £(9) can be dehned from the second moment of the 
two-point correlation function Gp(x — y) = (Tr P(x)P(0)) of the operator P{j(x) = z i (x)zj(x) 1 i.e. [f] 

, x9 [d 2 x \x 2 G P {x) , x 

J d z x Gp{x) 

The scaling function f{9) can be expanded around 9 = as 

f{e) = \c6 2 L + J^b 2n 6 2 \ (8.7) 



3 The second-moment correlation length £ of Gp(x) turns out to be more suitable for a l/7V-expansion than the length scale 
£tu (inverse mass gap) determined from the large-distance exponential decay of Gp(x), i.e. Gp(x) ~ e 1 33 1 / Cuj a t large due 
to its analytical properties in 1/N [121, 122]. While using £ leads to an expansion in powers of 1/N, the use of £ w gives rise to 
nontrivial power corrections: Indeed, in the large- N limit £/£w = \/2/3 + 0(7V -2 / 3 ). 
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where C is the dimensionless quantity x£ 2 at 9 — 0, and \ is the topological susceptibility 



X = / d 2 x(q(0)q(x)). (8.8) 



8.3 9 dependence in the large-iV limit 

The 9 dependence of the theory can be investigated within the 1/N expansion. In the large- N limit we 
have [121,166,381,548] 

m = A^N 02 + 0{1/N2) (8 ' 9) 



for \9\ < ir, thus the topological susceptibility is 0(1/ N), 

= Xe = 7^T- T +0{l/N 2 ). (8.10) 



c _d 2 f(6) 



de 2 



2ttN 

8=0 



As with 4D SU(N) gauge theories (see Sec. 13.21 and in particular Eqs. (|3.20|) and (|3.2ip ). the 2w periodicity 
of the 9 dependence must give rise to a cusp at 9 = n, and therefore to a singular behavior, which will be 
further discussed in Sec. 18.51 

The coefficients b 2n of the expansion (|8.7[) are obtained from appropriate 2n-point correlation functions of 
the topological charge density operators at 8 = 0. The analysis of the 1/iV-expansion Feynman diagrams [122] 
of the connected correlations necessary to compute b 2n shows that they are suppressed in the large- N limit, 
as [192] 

b 2n = 0(l/N 2n ). (8.11) 
Rather cumbersome calculations lead to the results [192] 

27 ■ 0(l/N% &4 = -^+0(l/iV 5 ). (8.12) 



57V 2 w " 7N 4 

The above results are consistent with large- N scaling arguments applied to the Lagrangian (|8.ip . which 
indicate that the relevant 9 parameter in the large- TV limit should be 9 = 9/N. This implies that the 
ground-state energy can be rewritten as 

f(9)=Nf(9 = 9/N), (8.13) 
1 n=l 

where C = NC and fan = N 2n b2 n are O(N ). Note the analogy with the expected 9 dependence of the 
ground-state energy in 4D SU(N) gauge theories, cf. Eq. (|3.16j) . 

Within the 1/N expansion one may also study the dependence of the mass M(9) = ^(9)^ 1 on the 
parameter 9. We write 

M{9) = = M (l + m 2 9 2 + ...) . (8.14) 

The analysis of the corresponding diagrams in the 1 /N expansion indicates that m 2 is suppressed as 

m 2 = 0{l/N 2 ). (8.15) 

Once again, the relevant parameter is seen to be 9 = 9/N. 

Concerning the relevance of semiclassical instanton solutions in the context of the 1/N expansion in 
(jpN-i moc i e i Sj it was shown in Ref. [339] that, at the quantum level, instantons appear in the form of poles 
of the effective action, instead of stationary points, and that the 1/N expansion and the semiclassical method 
correspond to two alternative contour integrations of the path integral. The problem of the summability of 
the instanton contributions [245] has been discussed in Ref. [171]. The compatibility of instanton models 
and large- N expansion has been also addressed in Refs. [204, 350] . 
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8.4 Lattice calculations at 9 = 



2D CP N 1 models have also been studied by exploiting lattice techniques, based on the straighforward 
discretization of the continuum action [217] (in the following we set the lattice spacing a = 1) 

S L = -NpY,\^+^n\ 2 (8.16) 

or the lattice action [76,217,457]: 

S\ = — (z n +^z n \ n ^ + z n £ n +^A ni(tl — 2) , (8-17) 

n . fj, 

where /3 = 1/g, z n is a complex TV-component vector, constrained by the condition z n z n = 1, and X n ,p, is 
an auxiliary U(l) variable. The lattice CP W_1 model represents a very useful theoretical laboratory for 
testing the various numerical methods to determine topological properties, and in particular the topological 
susceptibility, also because the results can be compared with the analytic large- N calculations. 

A lattice topological charge density operator ql, having the correct classical continuum limit q L (x) — 
a 2 q(x) + 0(a 4 ), can be defined as 

= -^E e ^ Tr [P(x)AlP(x)AlP(x)] , (8.18) 

where A s is a symmetrized version of the finite derivative: A s fi P(x) = ^[P(x+fx) — P(x—fj,)], and Pij(x) = 
Zi(x)zj(x). Then, one may define the corresponding lattice susceptibility xl = (52 x qL(0)qL(x)}. Analo- 
gously to the 4D SU (N) gauge theories, see Sec. 15.21 the relation with the continuum topological susceptibility 
is given by 

XL = a 2 Z 2 LX + B, (8.19) 

i.e. it involves a multiplicative renormalization and a background term which eventually becomes dominant 
in the continuum limit. Some pcrturbative calculations can be found in Refs. [205,242]. 

One can also define the topological charge by a geometrical approach. The original geometrical construc- 
tion for the topological charge proposed in Ref. [76] is 

Qg ^ ' Qn i 

n 

q n = -^Im[lnTr(P„ +A1+ ,P„ +Al P n )+lnTr(P„ + ,P n+Al+ , y P„)], /i/v, (8.20) 

where the imaginary part of the logarithm is to be taken in the interval (— vr, tt). For the lattice formulation 
(|8.17p an alternative geometrical definition Q 9: \ can be given in terms of the "gauge" field \ n ,^ [124]. 
Introducing the plaquette operator 

U\,n = A„ J ^ n +i£^n+2,i^n,2' (8-21) 

one defines Q g> \ = J2 n ?A,n where u\, n — exp(i27r<?A,n) and q\, n G (—1/2, 1/2). On a finite volume and for 
periodic boundary conditions q n and q\^ n generate integer values of the total topological charge for each 
configuration. They are not analytical functions of the lattice fields z and A, and fail to be defined on certain 
"exceptional" configurations, which constitute a set of measure zero. 

Fermionic constructions of the lattice topological charge density, based on the overlap Dirac operator, 
have been considered in Refs. [13,374]. 

The topological properties of 2D CP^ -1 models have been investigated by a large number of studies 
exploiting lattice techniques, see Refs. [29, 55, 70, 75, 77, 95, 113, 114, 124, 125, 193, 201, 202, 299, 331, 338, 350, 
374,393,450,451,458,475,533,535,554]. A wide range of values of N has been considered, both small and 
large, in order to test large- N calculations. The present state of art is briefly summarized in the following. 

Many numerical works, see Refs. [75,77,95, 124,201,202,205,243,393], have been dedicated to the N = 2 
case, which also corresponds to the 0(3) nonlinear a model. The most recent simulations using the so-called 
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1/N 



Figure 18: MC results for the scaling ratio x£ 2 versus 1/N. Results indicated by crosses are taken from 
Rcf. [193] (N = 10,15,21, obtained by the geometrical method), by a circle from Ref. [533] (N = 41, 
geometrical method), by a box from Ref. [125] (JV = 4, heating), by a downward pointing triangle from 
Rcf. [113] (N — 4, obtained using also [125] = 0.985(4) where £ w is the length scale extracted from 

the exponential long-distance behavior of the pj correlation function), by a right triangle from Ref. [29] 
(N = 10, heating) and by a left triangle from Ref. [29] (N — 10, cooling). The line shows the asymptotic 
large- N behavior x£ 2 = 1/(2ttN). In some cases the data are slightly shifted along the x-axis to make them 
visible. 



classical perfect action [95,202] favor what is suggested by semiclassical arguments, that is that \ would not 
be a physical quantity for this model, in that a nonremovable ultraviolet divergence affects the instanton 
size distribution. The manifestation of this property on the lattice would be that lattice estimators of x do 
not properly scale approaching the continuum limit. 

This problem should not affect the topological properties of CP^" 1 models for N > 2. Monte Carlo 
results can be found in Refs. [13, 29, 113, 124, 125, 193, 298, 299, 338, 446, 458, 533, 554]. They have been 
obtained using the various methods discussed in Sec. 15.31 In particular, at large N, say N > 10, the 
geometrical estimator (|8.20l) shows scaling already at reasonable values of the correlation length [125,193,533]. 
Some results for the dimensionless quantity x£ 2 at iV > 4 are shown in Fig. [THJ For N = 10, there are 
accurate estimates of the continuum limit of x£ 2 by different methods: x£ 2 — 0.0175(3) using the geometrical 
method [193], x£ 2 = 0.0169(7) and x£ 2 = 0.0169(4) using respectively heating and cooling [29], which are in 
good agreement. The results clearly approach the large- TV asymptotic behavior (|8. 10[) . Deviations from this 
behavior are seen to be quite small, and clearly suppressed at large N. Writing 

y£ 2 = — !— + % + 0(1/N 3 ), (8.22) 
^ 2ttN N 2 V 1 h V ' 

the MC results suggest a small 0(1/N 2 ) coefficient: e-i « 0.15. This number can be compared with the rather 
cumbersome 0(1/ N 2 ) calculation in the framework of the large- N expansion [121], which confirms that the 
0(1/ N 2 ) correction is small, but gives the number ~ —0.060. The origin of this apparent discrepancy 
remains unclear. 

Numerical MC investigations of topological charge structure in 2D CP N ~ X models are reported in 
Refs. [13,350,374], finding evidence of extended coherent structures, such as instantons and domain walls, 
which led to a qualitative picture of the relevant vacua, and of the changes with increasing N; they point 
to a transition from an instanton dominated vacuum, at small values of N, to a domain wall dominated 
vacuum at large N. 
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8.5 Results around 9 = n 

The 8 dependence of CP W_1 models, and in particular around 9 = tt, is physically relevant in condensed 
matter physics. The CP 1 or 0(3) a model at 8 — tt should describe the antiferromagnetic spin 1/2 chain [6, 
291], which is gapless. Thus the mass gap in the 2D 0(3) a model is expected to vanish when 8 — > tt [7-9]. 
The spectrum around 9 = tt has been studied in Ref. [150]. The behavior of CP N ^ models around 9 = tt 
has also been discussed in connection with the theory of the quantum Hall effect, see e.g. Ref. [455] and 
references therein. It has been conjectured [5, 7] that the ground-state energy F{9) of CP N_1 models with 
N > 2 presents a discontinuity at 9 = tt, like the large- N limit [475]. We also mention the quite general 
scenario argued in Ref. [58]: assuming a nontrivial 9 dependence and quantization of the topological charge, 
the theory either breaks spontaneously CP at 9 — tt or shows a singular behavior at some critical 9 C between 
and tt. 

Several lattice studies have also investigated the behavior of the theory at finite 9 and not only around 
9 = 0, see e.g. Refs. [38,54,56,59,70,89,114,350,437,450,451,475]. Like 4D SU(N) gauge theories, 
the Euclidean action with the 9 term is complex, which impedes a Monte Carlo simulation of the theory 
using its straightforward discretization, i.e. by adding the 9 term to the lattice formulations (|8.16p or 
(|8.17[) . Essentially two approaches have been pursued. In one of them the topological charge distribution is 
determined at 9 — by Monte Carlo simulations, and then it is used to estimate quantities at finite 8. In an 
alternative approach one gets information of the finite-^ behavior by performing Monte Carlo simulations 
at complex values of 9 where the action is real, and then somehow extending the results to real 8 values. 
The numerical studies [38,59,89] of the critical behavior of the CP 1 or 0(3) a model have confirmed the 
theoretical expectation of a gapless theory at 6 = w. Results for CP N ^ models with larger values of N, 
see Refs. [56,70,114,451,475], confirm a smooth 9 dependence up to 9 — tt, where there is a singularity, like 
the one found in the large- N limit. In particular, in Ref. [70] the sign problem was overcome by simulating 
an appropriate SU(N) quantum spin ladder equivalent to a CP N ~ X model with 9 — tt; the results provided 
evidence of a first-order transition at 9 = tt (discontinuity in the ground-state energy F(9)) with spontaneous 
breaking of charge conjugation symmetry for CP^ -1 models with N > 2. Ref. [350] used fractionally charged 
Wilson loops to infer properties related to the 8 dependence, providing further support to the first-order 
transition scenario at 8 = tt for sufficiently large values of N. 



9 The two-point correlation function of the topological charge 
density 

The two-point correlation function 

G(x - y) = (q(x)q(y)) (9.1) 

of the topological charge density presents a peculiar behavior, especially if contrasted with the physical 
renormalization-group invariant meaning of its positive integral, i.e. the topological susceptibility. Reflection 
positivity implies G(x) < for |jc| > 0. On the other hand, the susceptibility must be positive, trivially 
from its definition. These facts indicate that there is a positive contact term at x = 0, that contributes to 
the determination of the physical quantity \- I n this section we discuss the main features of the correlation 
function G{x). 

9.1 General features of the two-point function and reflection positivity 

In order that Euclidean correlation functions can be continued back to Minkowski space, they have to obey a 
positivity condition: the so-called reflection positivity [438,439]. The general statement concerning reflection 
positivity is that 

((QF)F) > 0, (9.2) 

where O is the antilinear reflection operator consisting in a Euclidean time reflection and a complex conju- 
gation, and F is an arbitrary gauge invariant function of the fields having support only at positive Euclidean 
times (see also Ref. [404]). As a consequence of the intrinsic odd parity of q(x) under reflection, 

<dq(xi,x 2 ,xz,x 4 ) = -q{xi,x 2 ,x 3 ,-X4), (9.3) 
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reflection positivity states that [476,477,535] 



G(x) < for |i| > 0. (9.4) 

This fact holds for any operator that is intrisically odd with respect to reflection symmetry in the Euclidean 
space. 

The asymptotic large- and small-distance behavior of G(x) can be inferred by general arguments. At 
large distance G(x) should decay exponentially as er m \ x \ where m is the lowest mass in the corresponding 
h channel. In the presence of fermions we expect G(x) ~ e _r7 Vl x l apart from negative powers of 
Dimensional, perturbative and renormalization group arguments [535] tell us that for r —* 



G(x) = 



(lnr) 



i + o [ -i- 

In r 



(9.5) 



where c is a negative constant. The logarithms can be related to a running coupling constant; indeed, in 
perturbation theory G(x) is 0(g 2 ). Since the topological susceptibility x is positive (\ — m the presence 
of a massless fermion) and G(x) < for i^O, G(x) should develop a positive diverging term at x = 0, that 
compensates the negative contribution of its integral for i^O and makes % positive. Inspite of this singular 
short-distance behavior, the low-momentum behavior and in particular the moments of G{x), 

= f d d x(x 2 ) n G(x) (9.6) 



M2r, 

(/Jo = x)> are expected to be well defined and finite. Below we will see in a solvable model, the large- N limit 
of the two-dimensional CP 1 ^^ 1 model, how these features can coexist. 

It is worth mentioning that, besides the topological susceptibility, also the zero-momentum slope x' is 
of phenomenological relevance, in connection with the spin content of the proton [422,423,490]. In the 
Minkowskian space-time the Fourier transformed correlation function of the topological charge density is 
given by 

X (p 2 ) = -%! d d xe i ^(0\Tq(x)q(0)\0), (9.7) 



X — x(0) is the zero-momentum topological susceptibility, while the so-called zero-momentum slope is defined 



as 



X'^-^X(P 2 )\ P =0. (9.8) 



In Euclidean space it corresponds to 



X' = ^ 2 , (9.9) 

where [12 is the second moment of the Euclidean correlation function G(x), cf. Eq. I|9.6[) . x' nas been 
determined within chiral perturbation theory, obtaining [372] 

»irf(E^)(E£r- <m 

f ! f 1 

For Nf — 3, and inserting the phenomenological values of the mass ratios [371], Eq. (|9.10|) gives x' ~ 
(47MeV) 2 . Consistent results have also been obtained by QCD sum rules [330]. x' nas been also estimated 
in the massless limit [421-423] obtaining x' — [26.5(3.1) MeV] 2 . There are also results for the theory 
without quarks, i.e. the pure gauge theory. An analysis [420, 421] based on the QCD sum rules gave 
x' ~ —[7(3) MeV] 2 . Lattice studies of this quantity have been reported in Refs. [102,105,207]. 



9.2 The two-point function on the lattice 

In the lattice formulation of a theory one may define two versions of reflection symmetry that are equivalent 
in the continuum limit: site- and link- reflection symmetry. We recall that reflection positivity is essential on 
the lattice for the existence of a self-adjoint Hamiltonian at finite lattice spacing defined from the transfer 
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Figure 19: Results (indicated by boxes) for the two-point function (gi(O)gi(r)) of lattice topological charge 
densities versus the distance r, determined using HYP smearing techniques, computed from 2+1 flavor 
dynamical configurations using staggered fermions. From Ref. [79]. 



matrix. Site- and link-reflection symmetries are both satisfied by the Wilson lattice action of SU (N) gauge 
theories [439], and by Wilson fermions with Wilson parameter r = 1 [399]. So Eq. (|9.2|) must hold also on 
the lattice for finite lattice spacing. A lattice discretization of the topological charge density operator is for 
example that given in Eq. (15. 2|) . One may easily verify that qh(x) transforms as q(x), cf. Eq. (|9.3p . under 
site reflection. Again, reflection positivity tells us that 

G L (x) = (q L (x)q L {0)) < 0, for x ? (9.11) 

(at least for x lying along the lattice axes and when there is no overlap among the link variables of the two 
operators). In the continuum limit the lattice correlation Gl(x) should reproduce the continuum correlation 
function G(x). 

The two-point function of the lattice version of the topological charge density has been numerically 
investigated by Monte Carlo simulations in Refs. [34,79,103,164,300,321,325,329,520]. Their results show 
the expected behavior: they are negative at sufficiently large distances, i.e. when the effect of the overlap 
between the links involved in the definition of the lattice density disappears. This is clearly observed in 
Fig. [T9l obtained in Ref. [79] from Monte Carlo simulations with staggered fermions, and the HYP smoothing 
technique to determine the topological charge density [181,302]. Similar results have been obtained by 
employing other lattice definitions of the topological charge density [34,164,300,321,325,329,520], such as 
the overlap definition in pure SU (3) gauge theory. 



9.3 Analytic results in the large-iV limit of the 2D CP 1 model 

The arguments of the previous section can be applied to the two-dimensional CP 1 ^^ 1 models as well. The 
topological charge density q(x) has been defined in Eq. 18.41 Like the topological charge density of QCD, 
q(x) transforms as 

Qq(x 1 ,x 2 ) = -q(xi,-xa) (9-12) 
under reflection symmetry. Therefore, as a consequence of reflection positivity, 

G(x -y) = (q(x)q(y)) < for x - y ? 0. (9.13) 

The two-point function G(x) of q(x) can be computed in the large-iV limit, in the continuum and on the 
lattice. These analytical results show explicitly that G(x) develops a singular behavior at the origin consis- 
tently with the reflection positivity requirement G(x) < for x ^ 0, and the positivity of the corresponding 
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topological susceptibility, i.e. of its space integral. Nevertheless, the low- momentum behavior of G(x) turns 
out to be well defined without the need of special subtractions. This provides an explicit example where the 
conjectured main features of the two-point function G(x) of QCD are analytically verified. 



9.3.1 The large-iV limit in the continuum 



Straightforward calculations in the large- N limit lead to the following expression for the Fourier transform 
of G{x) [535]: 



NG(p) = ±-p 2 



u(p) In 



u(p) + 1 
u(p) - 1 



(9.14) 



u{p) 



1 



3p 2 £ 2 ' 

where £ is the length scale defined in Eq. (|8.6[) . The scaling function 

B{k)=eNG{p = k/£), 

has the following asymptotic behavior 

k -> oo : B(k) = 
k -> : B(k) = 



2vr(ln(6fc 2 ) - 2) 
1 3 .„ 



O 



1 

hTfc 



27 



2tt IOtt 



350tt 



fc 4 + 0(fc 6 ). 



(9.15) 

(9.16) 
(9.17) 



The singular behavior of G{x) at small distance is already apparent from the asymptotic behavior (|9.16[) of 
its Fourier transform. 

The calculation of the large- N limit of G(x) requires performing the Fourier transform of the expression 
(|9.14[) . As before, let us introduce the dimensionless quantities x = x/£, and: 



c{x) = ecm 



The moments of C(x) 



d 2 k 



fx 2n = / d z ~x {x z ) n C{x) 



a B(fc). 



(9.18) 



(9.19) 



and therefore of G{x), can be easily obtained from the expansion of B(k) in powers of k 2 , cf. Eq. (|9.17f) . In 
the large- TV limit one finds 



Mo = x£ 



2nN 



M2 



57TiV 



° [ W 2 



(9.20) 
(9.21) 



and so on. By rotational invariance, C(x) depends only on r = \x\. For r > 



1 f°° 

C(x) = J ^ dtK (tr)t 3 v(t) 



^ )ln T ± ^y- 2 5 + 712,11)2 



(9.22) 



where v(t) = yX — 2/(3i 2 ). Since the Bessel function K$(x) satisfies Kq(x) > 0, Eq. (|9.22p shows that 
C(x) < for r > as expected. In Fig. [20] we show the function C(x). The integral representation (|9.22|) for 
C(x) holds only for r > 0. For r = the contour rotation leading to the integral representation (|9.22[) misses 
the contribution from the path at infinite distance, which is not suppressed anymore and should represent 
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Figure 20: Plot of In [— C(x)] versus r = \x\. 



the positive contact term (see Ref. [535] for details). The integral representation (|9.22[) allows us to derive 
the asymptotic behavior of C(x). At large distance C(x) decays exponentially: 



For r —> 0, C(x) diverges as 



C(x) 



C(x) = - 



1 e" 



24tt r 2 



1 



1 



1 + 



1 + 



1 

lnr 



(9.23) 



(9.24) 



2tt 2 r 4 ( mr ) 2 

Note that one could have obtained this short-distance behavior by calculating the leading order of pertur- 
bation theory, which is given by 



C(x) 
, on< 

7(r) 



9 



2n 4 r 4 ' 

Then, using renormalization group arguments, one replaces the coupling g with a running coupling constant 

7T 



ln(l/rA)' 



(9.25) 
nstant 

(9.26) 



thus recovering Eq. (|9.24|) . 

The diverging negative integral of C(x) for r > must be compensated by a diverging positive contribu- 
tion of the contact term at r — 0, so that 



dfx C(x) = B(0) 



2tt 



(9.27) 



Notice that, as a consequence of the r short-distance behavior and the positivity of Xj w e have formally 



lim 

8^0 



d x C(x) = oo. 



(9.28) 



|x|<5 



Thus (5-like distributions cannot represent a contact term with such singular properties. The behavior at 
x = should be described by more complicated distributions, which act on a finite interval around x — [535] , 
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Table 6: The large-TV limit of NG L (0), C L (x, 0) = £, 4 NG L (x 1 = x£, x 2 = 0) and £, 2 N X l for various values 
of £. For £ = oo the continuum results are recovered. 





(3 


NG L (0) 


Cl (1/4,0) 


Cl(1/2,0) 


Ci(l,0) 


Cl(2,0) 




l 


0.41829... 


0.5371 






-0.07888 


-0.001376 


0.199791 


2 


0.52868... 


0.2751 




-0.8148 


-0.01634 


-0.001257 


0.171593 


4 


0.63901... 


0.1759 


-9.314 


-0.1713 


-0.01615 


-0.001160 


0.162993 


8 


0.74933... 


0.1277 


-1.790 


-0.1728 


-0.01547 


-0.001136 


0.160316 


1G 


0.85965... 


0.09995 


-1.808 


-0.1662 


-0.01535 


-0.001131 


0.159498 


00 


00 





-1.718 


-0.1651 


-0.01532 


-0.001129 


0.159155 



see e.g. Ref. [537] for a standard mathematical text. An example of such distributions may be 



lim 



£^0 + 



P £ (d)6(x) - \ £ -^f (\n\x\)0(x - e) 



(9.29) 



where the polynomial P s (p) and X E are appropriate functions of s and the limit e — > must be considered 
in a weak sense, i.e. after performing the integral with the test function. Analogous considerations apply to 
QCD. 

In Refs. [10, 11,63] the behavior of the two-point correlation function of the topological charge density is 
discussed within the two-dimensional CP 1 / 0(3) a model. In particular, it has been shown [11] that one-loop 
perturbative calculations are consistent with the positivity requirements for such a correlation function. 



9.3.2 The large-N limit on the lattice 

It is also interesting to understand how the peculiar behavior of G{x) is recovered in the continuum limit 
of lattice models, and, in particular, how the short-distance diverging behavior for x — > and the x = 
distribution-like behavior is obtained from the lattice calculations where everything is finite. We consider 
the lattice action (|8.17[) which is more convenient for a large- N expansion [123,217,218]. One can easily 
prove that site- and link-reflection positivity holds for the lattice action f|8 . 1T|) . 

In an infinite lattice (free boundary conditions are assumed) one may consider the following discretization 
of the topological charge density operator 

QL(n) = —e^iOn,^ + &n+u,u — 9n+v,u — #n,i/)> (9.30) 
47T 

where 9 n ,^ is the phase of the field X n>fl , i.e. X n ^ = e 46, " fl , introduced in the lattice formulation (|8.17[) . Using 
the property of qL{n) under site reflection, one can prove Eq. (I9.11|) . At large N one can explicitly show 
that gi(n) has the correct continuum limit, and no lattice renormalizations are necessary [218,458]. Thus 
in the continuum limit and for \x\ > one expects 

C L (x) = ?NG L (xO = C(x) + 0(C 2 ), (9.31) 

where Gl(x — y) = (qL(x)QL(y)}, and C{x) is the continuum function defined in Eq. (|9.18[k £ the second- 
moment correlation length associated with the lattice correlation function Gp{x) = (Tr P(x)P(0)) , defined 
as in Eq. (f8"H|) . 

Gl{x) can be calculated in the large- N limit [535]. In Table [6] we report some results. In the following 
we list some interesting features arising from these lattice calculations, 
(i) The continuum limit of ^ 2 Gi(fc) at fc£ fixed is B(k£), cf. Eq. (|9. 15|) . Indeed at large £ 

eG L (k) = B(k0+O(C 2 )- (9.32) 

This may be proved by performing an asymptotic expansion of Gi,(fc) (at fixed fc£) in powers of £ -2 , following 
the procedure outlined in Ref. [123]. 
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(ii) Gl(x) is negative everywhere for x 7^ 0, consistently with reflection positivity. 

(iii) At fixed physical distance r — x/£ > the continuum limit exists and it is given by C(r), in agreement 
with Eq. (|9.31|) . Notice that the convergence is not uniform in r. Moreover, from Eq. (|9.32[) it follows that 

£ 2( - 1 - j) HLM = ihi + 0(C 2 ), (9-33) 

where (J>L,2j are the lattice moments of Gl{x), defined as in Eq. (|9.6p . and fL^j are the moments of C(x), cf. 
Eq. (l9T9j) . 

(iv) Gl(0) compensates the negative sum and makes \ positive. Moreover, at large £, where 
i ~ exp(27r/3), 

1 1 

(Lai) 5 ~ ^ 

giving rise to a positive contact term in the continuum limit. 

The continuum limit of the correlation function Gl(x) is regular at fixed physical distances and it is given 
by G{x). Also its moments have a regular continuum limit. On the other hand, a singular behavior is found 
at x = consistently with reflection positivity and positivity of \- These features should also characterize 
the Euclidean correlation function of the topological charge density in the continuum limit of lattice QCD. 



Gi(0) ~ -fiTTv? ~ tff> ( 9 - 34 ) 



10 Slow dynamics of topological modes in Monte Carlo simula- 
tions 

Monte Carlo simulations of statistical systems at the critical point and of quantum field theories, such as 
QCD, in the continuum limit are hampered by the problem of critical slowing down (CSD). For a general 
introduction to critical slowing down in Monte Carlo simulations, see, e.g., Ref. [502]. The autocorrelation 
time r, which is related to the number of iterations needed to generate a new independent configuration, 
grows with increasing length scale £. In simulations of lattice QCD and CP N ~ X models, where the upgrading 
methods are essentially local, it has been observed, see e.g. Refs. [124,173,193-195,378], that the topological 
modes show autocorrelation times that are typically much larger than those of other observables not related 
to topology, such as Wilson loops and their correlators. Actually, the heating method [209], used to estimate 
the topological susceptibility, essentially relies on this phenomenon, as discussed in Sec. 15.3.31 The slow 
sampling of the topological charge has been also discussed within MC simulations of full QCD, see e.g. 
Refs. [23,25,85,88,188,506]. 

Recent Monte Carlo simulations [194, 195] of 4D SU (N) lattice gauge theories have provided evidence 
of severe CSD for the topological modes, using a rather standard local overrelaxed upgrading algorithm. 
Indeed, the autocorrelation time T t0 p of the topological charge grows very rapidly with the length scale 
£ = er -1 / 2 , where a is the string tension, exhibiting an apparent exponential behavior r t o P ~ exp(c£) in 
the range of values of £ where data are available, as shown in Fig. 1211 Such a phenomenon worsens with 
increasing N, indeed the constant c appears to increase as c oc N. The worsening of the CSD with increasing 
N may be also related to the suppression of small instantons discussed in Refs. [164,379], see also Sec. 16.21 
A possible mechanism for the change of the topological charge in a MC simulation is that an instanton-like 
structure gradually shrinks until it disappears within the single lattice cell, or appears from it. This process 
requires a significant probability of having small instantons. But this probability is strongly suppressed in 
the large- TV limit, cf. Eq. (|6.6|) . Thus, at fixed length scale, the change of topological charge becomes more 
amd more difficult with increasing N [379], giving rise to a substantial worsening of the corresponding CSD. 

Of course, this behavior does not depend on the particular estimator of the topological charge. This 
peculiar effect has not been observed in plaquette-plaquette or Polyakov line correlations, indicating a small 
effective coupling between topological modes and nontopological ones, such as those determining the confining 
properties. 

These results suggest that the dynamics of the topological modes in Monte Carlo simulations is rather 
different from that of quasi-Gaussian modes. CSD of quasi-Gaussian modes for traditional local algorithms, 
such as those based on standard Metropolis, heat bath and molecular dynamics updatings, is related to an 
approximate random-walk spread of information around the lattice. Thus, the corresponding autocorrelation 
time t is expected to behave as r ~ £ 2 (an independent configuration is obtained when the information travels 
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Figure 21: Autocorrelation times of the topological charge from MC simulations of 4D SU(N) lattice gauge 
theories for N = 3, 4, 6, versus c -1 / 2 where a is the string tension. From Ref. [195]. 



a distance of the order of the correlation length £, and the information is transmitted from a given site/link 
to the nearest neighbors). This guess is correct for Gaussian (free field) models; in general one expects 
that t ~ where z is a dynamical critical exponent, and z sa 2 for quasi-Gaussian modes. On the other 
hand, in the presence of relevant topological modes, the random-walk picture may fail, and therefore we 
may have qualitatively different types of CSD. These modes may give rise to sizeable free-energy barriers 
separating different regions of configuration space. The evolution in configuration space may then present a 
long-time relaxation due to transitions between different topological charge sectors, and the corresponding 
autocorrelation time should behave as r t0 p ~ exp Ff, where Ff, is the typical free-energy barrier among 
different topological sectors. However, this picture remains rather qualitative, because it does not tell us 
how the typical free-energy barriers scale with the correlation length. For example, we may still have a 
power-law behavior if Ff, ~ ln£, or an exponential behavior if Fj, ~ £ s . It is worth mentioning that in 
physical systems, such as random-field Ising systems [249] and glass models [441], the presence of significant 
free-energy barriers in the configuration space causes a very slow dynamics, and an effective separation 
of short-time relaxation within the free-energy basins from long-time relaxation related to the transitions 
between basins. In the case of three-dimensional random-field Ising systems the free-energy barrier picture 
supplemented with scaling arguments leads to the prediction that r ~ exp(c£ s ), where s is a universal critical 
exponent [249]. 

The severe CSD experienced by the topological modes under local updating algorithms is expected to be 
a general feature of Monte Carlo simulations of lattice models with nontrivial topological properties, since 
the mechanism behind this phenomenon should be similar. This has been also observed in two-dimensional 
cp jv-i moc j e i s [124,193]. The numerical study of Ref. [193] for various values of N shows that an exponential 
Ansatz, i.e. T top ~ exp(c£ s ) with s w 1/2, and c cx N, provides a good effective description in the range of 
the correlation length £ where data are available (however, the statistical analysis of the data did not allow 
one to exclude an asymptotic power-law behavior r ~ £ z with z > N/2 setting in at relatively large £). Some 
data for the autocorrelation time of the topological charge are shown in Fig. [22l 

Two remarks are in order. The first one is that, although the effects of the topological CSD have not been 
directly observed in plaquette-plaquette or Polyakov line correlations, the CSD of the topological modes will 
eventually affect those observables as well. The point is that the results of Ref. [192], summarized in Sec. 16.51 
show that the correlators of plaquette operators and topological charge do not vanish at finite N, although 
they are quite small, and therefore there is not a complete decoupling between topological and nontopological 
modes. The strong critical slowing down that is clearly observed in the topological sector will eventually 
affect also the measurements of nontopological quantities, such as those related to the string and glueball 
spectrum. The second remark is that the above CSD is not restricted only to the total topological charge of 
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Figure 22: Autocorrelation times of the topological charge from MC simulations of 2D CP N 1 models. 
From Ref. [193]. 



the lattice configuration, which is the strictly zero-momentum topological mode, but it should affect general 
low-momentum topological modes. For example, it would also affect the topological charge as measured on 
a part of the lattice, a half say. Therefore, it may not be possible to reliably estimate quantities which are 
sensitive to the topological modes when the simulations are not able to sample them, and in particular when 
simulations remain trapped at a fixed topological charge sector. 

The issue of CSD of topological modes has serious implications for Monte Carlo simulations of full QCD. 
It may pose a serious limitation for numerical studies of physical issues related to topological properties, 
such as the mass and the matrix elements of the rf meson, and in general the physics related to the broken 
U(1)a symmetry. Indeed, it is expected to substantially worsen the cost estimates of the dynamical fermion 
simulations for lattice QCD using the actual optimal hybrid Monte Carlo algorithms, see e.g. Refs. [220,297, 
337,386,442]. Note that standard cost estimates of these algorithms, see e.g. Refs. [84,189,337,525], which 
are already quite elevated, have been obtained without taking into account the CSD of topological modes. 
In this respect, it would be important to devise algorithms which sample topological sectors more efficiently. 

The correct sampling of topology becomes increasingly more difficult towards realistic simulations with 
light quarks and sufficiently fine lattices. Standard local simulations, which change the gauge field configu- 
ration by small steps, may get trapped in a fixed topological charge sector, without being able to change it 
during the whole simulation. The question arises whether one can obtain the interesting observables averaged 
over the full 9 vacuum. Of course, physical issues that are particularly sensitive to the topological sampling 
cannot be addressed. However, one should also consider that a fixed finite global topological charge Q is 
equivalent to a boundary condition, thus the effects get suppressed in the thermodynamic V — > oo limit, 
although such finite size effects may be much larger than those observed when Q is correctly sampled, which 
are expected to be suppressed as e _m,ri . However, the quenched study reported in Ref. [261] indicates quite 
a strong dependence of the pseudoscalar correlation function on Q. 

Knowing the 9 dependence of a given observable allows one to estimate the finite-size effects when aver- 
aging at fixed topological charge [43, 108]. For example, let us consider an observable P whose 9 dependence 
around 9 = is given by 



P may be the string tension or the glueball mass in pure gauge theories or a hadron mass in full QCD. The 
average Pq of P at fixed Q is given by [108] 




(10.1) 




(10.2) 
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Therefore, if the calculation over a spacetime volume V is performed at fixed finite Q, the error is order 1/V. 
This formula may be used to correct averages obtained at a fixed topological charge. 

The variations of topological sectors are particularly delicate in Monte Carlo simulations of overlap 
fermions, and in general of Ginsparg- Wilson fermions, because they concide with the nonanalyticity points 
of the Dirac operator, and require a particular numerical effort to be handled [163,250,467]. In this respect, 
the simulation performance is expected to improve by suppressing the occurrence of plaquette values far 
away from the identity, which are essentially responsible for the change of topological sector. Of course, as a 
consequence, the simulation is performed at a fixed topological sector. This can be achieved by appropriately 
choosing the action. Numerical studies of these actions can be found in Refs. [88,255]. 

Simulations at fixed topological charge turn out to be particularly useful in the so-called e regime, where 
£ » L, and the finite-size effects can be analytically computed by an appropriate expansion [267,426]. 
In this regime observablcs are strongly sensitive to the topological sector and predictions exist for each 
sector [167,168,373]. This is important because the low-energy physical quantities of the infinite volume 
limit also appear, and therefore may be estimated by numerical simulations in this regime, see e.g. Ref. [276]. 
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